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PREFATORY NOTE. 


lo eacli set of Lectures delivered before the Institute of 
Actuaries^ when published in book form^ there has generally 
been prefixed a short preface^ or introduction, Yvritten by the 
President of the Institute then in office. This course, 
admirable in itself, cannot well be followed on the present 
occasion, having regard to the fact that Mr. Hardy has, in the 
interval between the delivery of the Lectures and their 
publication, himself been elected to the Presidenti^ chair. 
It.ha§ therefore devolved upon us, as Honorary Secretaries of 
*%^^nstitute, to insert this foreword in explanation of a 
seeming omission, and to express therein the confidence of the 
Council that the Lectures will be found to be of the greatest 
interest and value to the profession, which already owes so 
deep a debt of gratitude to their author. 


J. E. E. 
W. P. R 


PEEFACE. 


iHE object of the following Lectures was to deal with the 
theoretical considerations that should govern the selection 
a,nd treatment of such statistics as form the basis of the 
various tables of mortality^ sickness^ secession^ marriage, 
superannuation, etc., which are of use to the Actuary. It 
should be noted that in neatly all cases where mortality 
tables are specially referred to what is said may be extended 
to other types of statistics, though, to avoid repetition, that is 
not always pointed out. 

Some apology is required for the long delay in the publica- 
tion of the Lectures. It was intended subsequently to their 
delivery, to expand them into something like a complete 
treatment of the subject (from the theoretical point of view), 
•and to add a sufficient series of examples to illustrate the 
various’ points of theory. Unfortunately I have not found 
time to carry out this intention, but as regards that part of 
the subject dealing with the use of the Pearsonian Types of 
Prequency Curves in. Statistics this has been rendered un- 
necessary by the appearance of Mr, Elderton^s admirable 
book upon Frequency Curves and Correlation published 
by the Institute of Actuaries in 1906. 

A few additions have, however, been made to the 
Lectures as originally delivered, and where these appeared 
to interfere with the continuity of the text they have been 
relegated to notes placed at the end of the Lectures. 

I have very specially to thank Mr. G. J. Lidstonk, F.I.A., 
for several valuable suggestions, in particular for the con- 
tribution of Notes, and for’ assistance in preparing the 
lectures for the Printers; and also Dr. Jamb:s Buchanan, 
M.A., F.I.A., P.F.A., for having kindly revised T}he proofs 
and checked the algebra and numerical work. 


G. P. H. 


The Theory of the 

Construction of Tables of Mortality 

AND OF 

Similar Statistical Tables in use by the Actuaiyc 

BY 

G. F. HARDY, F.I.A. 

FIKST LECTUEE. 


When the Council asked me to deliver a series of lectures 
upon some subject connected with Part III of the Institute 
Examination I selected the construction of mortality and 
similar statistical tables^ mainly because it seemed to me to lie 
at the basis of our work. Actuarial science, in the modern 
sense of the term, had its origin in the collection of statistics 
(ho wevei^ rough and inaccurate these may have been), and their 
use for the purpose of calculating life contingencies; and 
although the Actuary has now to take account of a wider range 
of subjects than formerly, the collection and analysis of past 
experience and the employment of the results of such analysis 
to forecast the future is still his most important function. 

The title of the lectures is somewhat wider and more 
ambitious than the contents may be found to warrant. To 
justify it fully would involve dealing with many questions of 
detail relating to the collection and tabulation of data, such, 
for example, as the various methods for computing the 
numbers exposed to risk in a mortality experience, &c., 
which have been many times discussed in the volumes 
of the Journal of the Institute of Actuaries and many of 
which are exhaustively dealt with by Mr. Ackland in the 
recently published Account of Principles and Methods/’ It 
is evident that to deal with the subject in such detail, would 
outrun the limits of the six lectures which I have undertaken 
to deliver. I propose, therefore, to confine myself mainly to 
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a consideration of the general principles inYolved in the 
collection of statistical data, and in the construction frem 
such data of tables, of which the Mortality Table is the best 
kno^Yii and the most important, embodying the results in tlie 
form required by the Actuary, and, at the same time, to give 
such examples of the application of these principles as may 
be necessary to illustrate the subject. 

In this opening lecture in particular, I shall ask your 
indulgence if occasionally my remarks appear to be of an 
elementary character, as I think it desirable that we should 
be perfectly clear as to first principles before going on to 
more detailed consideration of the subject. 

Statistical tables, in one form or another, are familiar to 
all of us. At the basis of all such tables, and, indeed, of the 
whole science of statistics, lies one of the most fundamental 
facts in nature, namely, that all phenomena of which we 
have any knowledge fall into certain classes, groups or series, 
and cluster round certain types. But for this fact we should 
be unable to classify our knowledge, indeed, should never 
have acquired any to classify. Speaking broadly, then, every 
object and every event that comes within our observation is 
one of a group or class of similar but not identical objects or 
events, which, as a class, is marked off by certain special 
features from every other class, although the dividing line 
may not always be sharply drawn. These groups or classes 
are not arbitrary, but are inherent in the nature of things, 
although it is true that the particular groups which we employ 
in classifying our knowledge are chosen with a view to our 
own convenience and to the limitations of our minds. 

From a consideration of a class of objects as a whole, we 
get a conception of an average, or type,* to which each 
individual in the class more or less conforms, but from 
which, notwithstanding, every individual also diverges. Such 
divergencies or variations of individuals from the average 
type may be discontinuous, themselves running into types, or 
they may be continuous. Among the individuals forming 
together the type mankind, are divergencies such as those 
due to sex, race, nationality, birthplace, occupation, civil 
condition, &c., discontinuous variations producing sub- 
groups, the boundaries of which overlap and interlace, each 

* The type of the class should preferably be considered as represented by the 
“ mods ” or case of most frequent occurrence rather than by the " average ” or 
“ mean ”, hut this point is not here of importance. 



of these smaller groups again being capable of endless 
subdivision. These diverg^encies can be dealt with statistically 
only by counting the members of the various sub-groups. 

On the other hand^ there are divergencies^ which we may 
term continuous, such as those due to differences of age, 
height, weight, income, &c., &c., differing from the former 
class in that they do not involve the separation of the main 
group into sub-groups, but relate to qualities, possessed by 
each member of the group in varying degree, capable of 
measurement and numerical statement, and involving the idea 
in each instance of an average. Thus we can speak of the 
average age, height, or income of a group of persons, not of 
their average occupation or nationality, although we may 
speak of the average constitution of the group in respect of 
these latter qualities. 

A statistical table deals with some natural group of 
objects or events and is a numerical statement of the manner 
in which the members of the particular group differ inter se in 
respect of some special character or characters. If dealing 
with discontinuous variations, as for example a table showing 
the occupations of a group of persons, it will exhibit, implicitly 
or explicitly, the ratio of the magnitude of each sub-group to 
the whote, at a given moment or moments or on an average of 
a given period ; or it may take the form of a statement of the 
extent to which variations in one respect are affected by 
variations in another, as, for example, a table showing the 
proportion of the sexes in different nationalities. If dealing 
'with continuous variations, it will either represent a series of 
measurements of some quality common to members of the group, 
showing its average value for the group, and the manner in 
which individual values are grouped round such average, or it 
may represent, numerically, the manner in which deviations 
from the average in respect of some one quality A are corre- 
lated with the deviations in respect of some other quality B. 

It is mainly with the class of statistical table deahng '^vith 
continuous variations that the Actuary has to deal ; variations 
in the ages of lives under obseiwation, their ages, or the 
periods elapsed since entry, at death, withdrawal, marriage, 
superannuation, &c. In such tables the grouping of individual 
measures round the average will, in general, but noli always, 
be found to follow, approximately, certain well-defined laws. 
Taking first the tables dealing with a single variable, the 
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folio-wing may be considered as an example. It is a 
statement of tlie heights of 2,192 school children, and is 
abridged from that given in a paper by Prof. Karl Pearson. 

Table I. 


Showing heights of 2,192 School Children, aged 12 years. 


Heights in 
Centimetres 

No. of Children 
Observed 

Computed Nos. 
by Curve 
iCe-x'^lc^ 

Computed- 

-Observed j 

+ 

— i 

(1) 

(2) 

(3) 

(4) 

(.S) 

139-140 

1 



1 

135-138 

6 

3 


3 

131-134 

31 

25 


6 

127-130 

107 

119 

12 

... 

123-126 

321 

338 

17 


i 119-122 

585 

577 


8 

115-118 

618 

596 

... 

22 

111-114 

359 

365 

6 


107-110 

126 

135 

9 


103-106 

35 

30 


5 

99-102 

3 

4 

1 


Total 

i 

1 2,192 

2,192 

45 

45 


Note. — I n the fonnnla (col. 3) x represents the deviation in ceutunotres 

2192 

from the average; c = 7*76 and k has such a value — jr- as to iri(p,lve the area 

c w 


of the graduated curve equal to the ungraduated ; that is, to make the totals of 
columns (2) and (3) equal. 


•If we consider tlie progression of tlie numbers in 
column (2), we stall see that they form a roughly symmetrical 
series^ being largest in the neighbourhood of the average 
height and diminishing gradually on either side. It will be 
seen that the average height is about 1 ] 8 the number 
exceeding this height being approximately equal .to the 
number falling short of it. In order to bring out the 
approximate law of the series, I have inserted in column (3) 
the computed numbers on the assumption that the frequency 
of a deviation of centimetres from the average is 

represented by the function where c has the value 7*76 

2192 

and K the value — 7^- The expression represents 

TT 

what is usually termed the curve of facility of error 
or the '^^normaP^ curve of frequency. It will be seen that 
while the figures in column (2), are as we should expect 
them to be with such limited data, somewhat irregular, they 
conform on the whole fairly closely to the normal curve. 



5 


The normal curve was first used to represent the dis- 
tribution as to magnitude of errors of observation in physical 
measurements. It must not be regarded as representing a law * 

of Nature, but rather an extremely convenient and often very 
close approximation to observation ; experience proving that 
in many cases errors of observation and the deviations of 
individuals from the mean of a class do follow very closely 
the law referred to. The formula is therefore empirical and 
not to be established by a friori reasoning ; at the same time 
we may, perhaps, see a logical basis in the following 
consideration. We may suppose that, in any individual 
measurement, the deviation from the mean of the class (as 
the difference in the height of any individual among the 
2,192 in Table I from the average height of the whole 
group) is the result of an infinity of minute causes as to 
whose nature we are in ignorance, any one of which may 
produce a minute positive or negative deviation from the 
average. These minute superimposed deviations being 
indefinitely small and indefinitely numerous, we may without 
loss of generality assume them of equal magnitude. It is 
then clear that the magnitude and sign of the total resulting 
deviation in any given case will depend upon the extent to 
which th5 number of these minute positive deviations exceed 
the negative, or vice versa. 

If the number of possible causes of deviation is 2n, 
and if the extent of each indefinitely small deviation is h 
{n being indefinitely large, but kVn finite), then the,- 
probability or ^^frequency'^ of a total deviation lying between^''^^^^^ 

X and x + h will depend on our having ^ 


values of h and ( 72 ^- 


2kJ 


negative values. The probability 


of this occurring will be represented by the appropriate term 
in the expansion of the binomial + or 

X 


72, -f 


Ik 


It may easily be shown that 
indefinitely great, takes the form 


this expression, n being 


1 


\/ TT/i 


, i,e. (Constant) x e ' 
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i.e,) of the curve of the facility of error I do not propose to 
discuss at any length the properties of this particular curve, J 
but you will notice that the cuiwe being symmetrical with 
respect to positive and negative values of a?, it assumes 
that positive and negative deviations of a given magnitude 
are equally frequent, the average magnitude of such devia- 
tions being small or large as c is small or large. The maximum 
ordinate corresponds to the value of aj=0, which is the 
average value of (jj ; it therefore passes through the centre 
of gravity of the area enclosed by the curve and the axis 
of X, and also divides that area into two equal parts. It 
assumes that indefinitely large deviations are possible, hence 
it cannot be rigidly exact, because when dealing with physical 
measurements of any kind, indefinitely large errors are not 
possible. This is not a practical objection to the use of the 
formula, however, as the probability thereunder of deviations 
of many times the average value is extremely small. 

The following table, showing the number of entrants in 
various aged groups in the 0^ Experience, exhibits a quite 
different distribution of the deviations from the average : 


Table II. 


Nimher of entrants in quinary age groups dc^.a. 


Central Age 
j . of Group 

X 

Actual Entrants 
ill Group* 

Computed Xo. 
by Eormulat 

Computec 

+ 

1— Actual 

(1) 

(2) 

(8) 

(4) 


20 

431 

436 

5 


25 

1,273 

1,305 

32 


i 30 

< 1,526 

1,473 


53 

j 35 

1,269 

1,265 


4 

! 40 

914 

930 

16 


1 45 

591 

604 

13 


! 50 

354 

349 


*5 

1 55 

182 

178 


4 

' 60 

83 

79 


4 

; 65 

26 

29 

’3 


i 10 

7 

8 

1 


' 75 

1 

1 



Totals 

6,657 

6,657 

70 

70 


* huiiclreds. 

t Formula representing* number of entrants at js:iven ase = 

(88-48-^)6-oo 4; where log /c= -9*2360. ^ ^ ^ 

The student may copult Woolhouse’s paper on “The Philosophy of 
Sta^tistics’ vol. xvii, p. 37), or an exhaustive analysis of the properties 

of the curve by Mr. Sheppard {mU. Trans., vol. 192, p. 101) ; also “ Bowlev^s 
Elements of Statistics Part II, Sec. II -v ^ ^ » 



Here tlie numbers also exbibit a well-marked law 
goverPxing tke deviations from tbe mean^ but tliis law is no 
longer tlie same as tbat shown by the ^^normaP^ curve of 
frequency. The maximum ordinate does not coincide 
either with the average age or with the central age 
of the series; while the number of cases exceeding the 
average age no longer equals the number falling short of it. 
In other words^ the curve is non-symmetrical or skew. It 
follows very approximately^ however, a certain law, as will 
be seen by comparing the numbers in column (2) with 
those in column (3), which represent the computed numbers 
according to the formula stated. 

Having regard to the fact that the numbers in column (2) 
represent lOO^s and not units, the differences between the 
actual and computed numbers are somewhat outside the 
probable errors of observation. There are, that is to say, 
systematic differences between the two curves. These 
systematic differences are generally to be expected in dealing 
with age statistics. It vrill be seen that they are not 
incompatible with a close agreement in the general features 
of the two curves, but they serve as a warning that, in 
statistics of this nature, formulae representing the 
distribu^ion of deviations from the mean must be regarded 
as approximations only. 

If w^e consider the curves exhibited in Tables I and II we 
see that the general character of such curves is determined 
by a few salient features : 

1. The position of the maximum ordinate; that is, the 

value of the variable having maximum frequency. 
This value is termed the mode, 

2. The average or mean value of the variable, being the 

arithmetical mean of all individual values. In a 
symmetrical curve this coincides with the mode.^^ 

3. The average deviation from the mean, corresponding 

to the closeness with which the individual measures 
are grouped round their mean value. There is a 
certain convenience, for analytical reasons, in 
adopting as our standard in this respect either the 
mean of the squares of the individual deviations, or 
the square root of this quantity. The latter is 
termed the standard deviation. We may represent 
the average of the squares of the deviations, or the 
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“mean square” deviation by the^symbol ih, wben 
the standard deviation becomes 

4, The equality or otherwise of the positive and negative 
deviations from the mean ■, that is, the sijmmetry or 
skewness of the curve. The sum of the first powers 
of the deviations is, of course, always zero. If the 
curve is symmetrical, the sum of any odd power of the 
deviations must be zero, but not otherwise. As we 
have employed the square root of the average 
square of the deviations as a measure of the 
diffuseness or spread of the curve, termed the 
“standard deviatioir”, so we may take the ratio of 
the cube root of the average cube deviation to the 
“ standard deviation ” as the standai-d of 

“skewness.” If we represent the average cube 
deviation by the symbol /xj, the skewness of the curve 

jy ytAq 

may then be measured by . 

The skewness is sometimes taken as the difference between 
the “mean^^ and the ^^mode^^, divided by the standard 
deviation. 

The sums of the successive powers of the deviations 
of the variable from the mean, the area of cuiw^ being 
taken as unity^ are termed the moments of the curve. 

These observed laws of the variation of measurements 
from their mean are very general^ and are usually, though not 
invariably, associated with what is termed homogeneous 
data. The distinction between ^niomogeneous and ^^letero- 
geneous^^ data is of considerable importance, although not 
very easy to define. "We may perhaps define a homogeneous 
group as one in which the continuous variations aro from a 
single type only, and are unaffected by any discontinuous 
variations in the group if these exist. Tlaese conditions will 
hardly ever prevail, but a group may be considered for practical 


purposes as homogeneous if the variations in the particular 
quality dealt with are not materially affected by any discontinoiis 
variations existing in the group. If, however, the group can 
be split up into two or three distinct series differing markedly 
in certain qualities, and these differences are found, or may 
reasonably*^ be supposed, to affect the character under 
examination, then the series is heterogeneous/^ 

Take, for example, the class representing assured lives of 



a given age, but of varying duration of assurance, and assume 
■we are investigating tlie rate of mortality of the class. If it is 
found on examination that the duration of assurance materially 
a:ffects the rate of mortality, then the data treated as a whole 
is heterogeneous. If it is found, however, that the duration 
of assurance after reaching a certain point has no such 
influence, or an influence that is insignificant, then the data 
from this point and in this respect may be treated as 
homogeneous. The same considerations apply to distinctions 
in class of assurance, amount of policy, occupation, &c. 

The laws which appear to govern deviation from the 
average in homogeneous data are, in general, so uniform in 
action that a departure therefrom wdll frequently indicate 
that data which might be supposed to be homogeneous are not 
so. An interesting* illustration of this may be seen in the 
case of the Male Annuitants in the New Offices^ Annuity 
Experience. Consider the following table showing the number 
of entrants for various gi'oups of ages : — 


Table III. 

Male Axxxtitaxts Bata. 

^ Nurnhe.r of ^entrants 4it various a^es, 1863-1S93. 


Ages 
at Entry 

Entrants 

Computed 

Numbers 

Observed 

— Computed 

1S63-1S93 

II 


j 



+ 

- 

(1) 

(•2) 

(3) 

0) 

(5) 

33-37 

73 

5 

68 


38-42 

119 

21 

98 


43-47 

207 

89 

118 


48-52 

421 

266 

155 


53-57 

599 

587 

12 


58-62 

957 

954 

3 


63-67 

1,147 

1,142 

5 


68-72 

982 

1,007 


25 

73-77 , 

660 

655 

5 


\ 

'r‘a-82 

252 

313 


61 

83-87 

72 

109 


37 

88-92 

15 

29 


14 

93-98 

1 

6 







'm 


These particular age groups are selected as there appears 
to be a slight excess in the number of entrants at decennial 
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and quinquennial ages^ and loj placing these in tlio middle 
of the groups we get rid of the disturhance, which would 
otherwise affect the numbers. 

An examination of the numbers in column (2)^ between 
ages 53 and 78, shows that they form a nearly symmetrical 
curve, as is seen by a comparison with a normal curve of 
frequency given in column (3).* The numbers above age 78, 
however, are in defect, and those below 53 are considerably in 
excess of the figures suggested by the normal curve. As 
regards the falling-off of the numbers at the older ages, it 
maybe conjectured that it is in part due to the fact that many 
published tables of the cost of annuities cease at age 7o or 
80. The observed excess in the number of entrants at ages 
below 50 evidently represents the entrance at these ages of a 
class of lives differing from those forming the bulk of the 
data. It may perhaps be conjectured that a number of these 
cases are counter lives in contingent reversions, or similar 
securities, upon whose lives annuities have been purchased to 
secure the payment of annual premiums. Be that as it may, 
we find that while the deficiency of entrants at the older 
ages does not appear to affect the mortality rates, the entrants 
at the younger ages on the contrary show abnormally heavy 
mortality, the ungraduated values of the expectation*" of life 
for entrants under age 55 being relatively low. Hence we 
may calculate that the male annuitant experience is hetero- 
geneous, and in using the results as a basis of calculation 
for the future, the abnormal part of the experience representing 
the entrants at the younger ages was properly rejected. 

In addition to tables of the kind we have been considering, 
a statistical table may be a numerical statement of the 
manner in w^hich variation in one particular from the average 
of the group is accompanied by variation in some other 
particular. We may, for instance, have a table representing 
a number of individuals, arranged according to height, the 
numbers at each height being further arranged according to 
weight. We should then have a table of double entry, each 
row or column of which would represent a statistical table of 
the form already considered. By means of this table we should 
be able to correlate as it is termed, variations in respect to 

The constants of this cnn^e were only roughly determined, Ijut the 
agreement with the ohseiwed numbers between ages 53 and 78 is sufficiently 
close to illustrate the point under discussion. 
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weight with variations in respect to height. Such a table 
would represent a mass of figures, the bearing of which could 
not easily be grasped without some further analysis. If, 
however, we add to the table a column showing the average 
weight for persons of a given height, we then have a ready 
means of seeing how this average weight is affected by a 
change in height. Having inserted the average, we have not 
exhausted the information which the original figures give us. 
We need also to know to what extent on the average the 
weight varies when the height remains constant ; that is, we 
need to insert against each average weight what we have 
termed the standard deviation.^’ 

A familiar example of such a table is one showing the 
ages of husbands and wives at marriage. Such a table would 
take the following form — 

Table IY. 


Slwioing Ages of Hush ands and Wives at date of Marriage, 













Wives’ Age 

s 



nusbaiuls’ 






— 

— 


under 

20^-30 

30-40 

1 

o 

50-60 

60-70 

Mean Ages 
of 

m 

20 






Wives 

uiuler 20 

13 

5 





17-8 

20'30 

215 

500 

16 

1 



22*3 

30-10 

14 

107 

39 

4 



27*0 

<10-50 

1 

14 

23 

12 

2 


35*0 

50-60 


2 

6 

9 

4 


42*1 

60 70 



1 

3 

4 

2 

52*0 

70-80 



... 

1 

1 

1 

65*0 

Mean 








Ages of 

25-1 

27*2 

37-6 

49-0 

58*6 

68*3 

— 

Hnsbiiiuls 









If there were no correlation between the ages of the 
husbands and wives at marriage, the figures showing the 
average ages for the various columns would (except for 
accidental iiuctuations) be identical, and the same would hold 
for the average ages of the successive rows. 

If a lino were drawn through the table cutting those 
points in the tows corresponding to the average ages, and 
another line similarly cutting those points in the columns 
representing average ages, it would be found that these 
points could roughly be represented by straight lines, which 
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in the pre^^ent example would be nearly coincident, since the 
spread of the figures, as measured by their standard deviation, 
is very similar in both rows and columns. 

It is not always the case,, however, that the nature of the 
correlation can be represented by a straight line. In the 
following example we have a somewhat different class of 
table showing the proportions for different age groups of 
wives and widows in an Indian pension fund. 

Table IVa. 


SJioioing proportion of TFtves and TVidoios in a JPension Fund, 


Ages 

Number of 
Wives 

Number of 
Widows 

' 

Total 

Widows, 
per-cent of 
Total 

under 20 

19 

... 

19 

0-0 

20-30 

1,430 
i 3,366 

50 

1,480 

3-4 

30-40 1 

355 

3,721 

9*5 

40-50 

3,329 

1,018 

4,347 

23-4 

50-60 

1,653 1 

1,312 

2,965 

44*2 

60-70 

476 

933 

1,409 

66-2 

70-80 i 

63 

330 

1 393 

84-0 

80-90 

i 

6 

1 46 

52 

88*5 


Here it will be seen, from the run of the figures in the last 
column, that they cannot be well represented by a ^raight 

line, being somewhat in the form of the curve of J e ’ * 

Qj^ 

or of the curve with values of 0 and 1 respectively at 

m -f 

the limits. 

Such a table of correlation has an analogy with the table 
of the “Exposed to Risk^^ and “Died^^, which ordinarily 
forms the basis of our Mortality Tables. This table is 
virtually in the following form — column (4) representing the 
number of annual survivors being usually omitted as being 
implicitly contained in columns (2) and (3) — 

Table of Exposed to Rish and Died. 


Age 

Exposed to Risk 

Died 

(b 

(2) . 

(3) 
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We liave here the ages of the persons observed ;* the 
numbers under observation, or Exposed to Eisk which, 
for the sake of simplicity, we will suppose to remain under 
observation for the entire year of age; the number of those who 
die during the year, and of those surviving. If we represent 
the rate of mortality by q^: then in all cases in column (3) 
= andin all cases in column (4) q^—O, and we have a 
table which is analogous to the table of the weights of 
individuals of respective heights, only that instead of having 
various values of we have in the nature of things only 
two possible values 0 and 1, the average value for each group 
representing the observed ^“^rate of mortality/^ This table 
differs from that correlating weights and heights, or ages of 
husbands or wives at marriage, agreeing with that correlating 
age and civil condition, in the fact that a certain quality 
or characteristic, in this case death during a given year 
of age, is not present in varying proportions, but is 
either present or entirely absent. We are thus introduced 
to the conception of probability, the proportion of any 
gi'oup surviving or dying representing the probability 
, of survival or death for any individual of the group taken 
at random. Tlie idea of probability is also j^resent in 
the sii!5)posed table of weights, although not so obviously. 
That table would inform us, for example, of the probability 
of a person of given height exceeding or falling short of 
a certain fixed standard weight, and we should then have 
a table identical in form with the table of Exposed to 
Risk and Died. 

This conception of ^probability is important to the Actuary, 
because his object in collecting statistics is the distinctly 
practical one of measuring the probability of the happening 
of certain contingencies. It is necessary to realise clearly 
what is meant by the statement that the probability of a 
particular event has this or that value. Laplace pointed 
out that wlien wo speak of the probability of the happening 
of a given event, wo do so only on account of oxxr ignorance 
of tlio antecedents of the event, or our inability to completely 
analyze them. If wo entirely knew the antecedents, and if 
our powers of analysis were ecpxal to the task, we could 
predict tlio event. In many cases we are abte to do this 
approximately, but where the effective causes at work 
are numerous and obscure, and the result in individual 
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fap|)areiitly similar) cases is Yery yariable^ as in all questions 
affecting life contingencies, ye are unable to forecast tbe 
event in a given case, and must fall back upon the average 
result deduced from the examination of a large number of 
similar cases. In other words, we treat the particular case in 
question as one of an indefinitely large class of similar cases, 
a sample of which we have already had under examination. 
From the results of such examination we infer the composition 
of the class as a whole, and hence the probability or 
average event in an individual case. If, in the sample 
observed, a given character is present in a certain proportion 
of cases, as for instance, where out of a number of persons of 
given age under observation, a certain proportion have died 
within the year of age, then we estimate the probability of 
the event happening in a particular instance, by the ratio 
which the number of cases in which the event has occurred 
bears to the entire number of cases observed.* To determine 
the probability of a given event is therefore to assign the 
case to the natural group or series to which it properly 
belongs and to pass under examination a sample of the group 
sufficiently large to enable us to determine approximately the 
^average character of the yffiole as regards the particular 
quality in question. We are here speaking of simple#e vents ,* 
the probability of a complex event, such as the survival of 
•one life by another, is, of course, not determined directly by 
past observations. The latter yield the simple probabilities 
■of surviving each year of age, by suitably combining which 
we arrive at the value of the probability desired. 

The degree of certainty with which we can deduce the 
properties of an entire class from the part known to us, 
depends first on our assurance that the class is homogeneous, 
or at least that the portion observed is representative, such 
as would result from a selection of cases made at random, and 
secondly, on the number of cases that have been under 

"• The formula deduced bj Laplace by -wliicii tbe true probability of an 
erent wiiicb has been observed to happen m times out of m -f « trials is taken as 

obviously not applicable to such a function as the rate of mortality, 

nor to any analogous function. It is snf&cient to consider that in tabulating the 
values of the probability of dying in each year of age, we are using an arbitrary 
unit of time which might just as well be a month or day, in which cases we should, 
by use of the above formulae, produce quite different mortality tables from the 
same data. 
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observation. If ^ve examine tlie figures in tables similar 
to Tables I and we see that, in proportion as the number 
of cases under observation is small, the figures representing 
the results of the experience are irregular, while, on the other 
hand, where the number of facts observed is very large, the 
irregularities become relatively less. We arrive at the same 
conclusion from theory. If an indefinitely large group Is 
contains Up objects of class A and 1^(1— 2 ^) objects not of 
class A, and if from the group n objects are selected at 
randohi, then on the average np of these will be of class A. 
If we represent the observed number in any given case as 
the average algebraical value of s will be zero, 
while its average numerical value, irrespective of sign, will 
be very nearly This latter quantity clearly 

increases as np increases, but at the same time its ratio to 
np diminishes. Thus in a table of exposed to risk and died 
the actual irregularities in the number of deaths increase 
with the magnitude of the experience, but the irregularities 
in the rate of mortality diminish. Hence from theory as from 
experience we derive the conviction that if instead of the 
limited number of facts which we have been able to examine, 
we could have examined an indefinitely large number of 
similar^ facts, the results would have been relatively free 
from irregularity, and capable of being expressed by a 
continuous curve ; without, of course, being sure that any 
such curve could be expressed algebraically. 

The idea underlying the graduation of the figures of a 
statistical table, whatever be the process employed, is that a 
continuous curve may be found representing the general trend 
of the observations freed from irregularities due to paucity 
of material. This curve, W’e have reason to believe, will 
correspond more closely than the ungraduated curve to the 
results obtainable from a much larger body of facts. This is 
the rationale of the process of graduation and its justification. 
Such a process cannot deal with systematic errors affecting 
the table as a whole and cannot compensate for inadequate 
data. It adds wreight to the results, however, at each 
individual point of the table, and assists in bringing into 
relief the true character of the curve by freeing it, in a 
large measure, from accidental irregularities. ^ 

^ The average value of s" ■will be mpq, the average value of will be 
np 2 (p- 2 ), and the average value of willhe7i3)g[(3n--6)pg-rl]- iS'eebTote A,p.llO. 
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‘There may be other objects aimed at in a graduation 
besides that of removing the irregularities from the roiigli 
figures^ with the view of bringing out more clearly the law 
underlying them. The Actuary constructs tables not merely 
to show what has happened in the past^ but to enable him to 
forecast the future^ and as he requires these tables as a basis 
for financial operations, considerations are introduced which 
do not arise in the treatment of purely statistical tables. 
Whatever class of events the Actuary may have to deal with, 
will be subject to change with the lapse of time. That 
portion of the class he has been able to observe lies 
necessarily in the past ; the conclusions he has derived from 
their study he proposes to extend to the future. He must 
therefore consider how far the observed characters of the 
class are changing or permanent, and must endeavour 
to distinguish between changes representing permanent 
tendencies and those due merely to temporary fluctuations. 
In the selection of data suitable for his purpose the Actuary 
will aim on the one hand at a sufficiently broad basis both in 
space and time to eliminate the effects of local and temporary 
fluctuations, and on the other hand he will aim at obtaining 
as far as possible a homogeneous group of data. These two 
aims are more or less in conflict, and he will lean to«the one 
side or the other, according to the object he has in view. 
Where, for example, that object is to produce a table that 
may be adopted as a general standard by various institutions, 
often differing considerably as to their individual experience, 
he must aim at a correspondingly broad foundation. In 
these circumstances it will not generally be possible to o];)tain 
a really homogeneous experience. If it is a question of the 
mortality of assured lives, for instance, this will be found to 
be affected by endless individual variations, age, sex, duration 
of assurance, occupation, civil condition, class of assurance, 
character of the insuring office, &c., &c., and from siu^h 
material approximately homogeneous data could only be 
obtained by cutting up the experience into comparatively 
small groups and thus sacrificing all generality. This (‘.a,n l)o 
avoided in practice by first excluding all extreme variations. 
The sexes will be separately treated, lives so impaired as to 
prospects ^ longevity by personal health, family Ihstory, 
occupation, or residence in unhealthy districts as to be '' rated 
up will be excluded, as also classes of assurance that may 
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be supposed. subject to rates of mortality differing from the 
average. When the data has thus been trimmed of the 
extreme variations^ a .body of experience will generally 
remain not greatly shrunken from its original dimensions 
and in which the discontinuous variations are sufficiently 
numerous and individually unimportant to render the data 
for practical purposes homogeneous. The rates of mortality, 
or of withdrawal, can then be treated as functions of the two 
remaining variables of importance, the age and the time 
elapsed from date of entry ; or as functions of the age only 
from the point at which the factor of duration may be found 
to be unimportant. 

On the other hand, the Actuary's object may be precision 
rather than generality ; he may have to deal with a group, 
subject to special conditions and presenting special 
characteristics, as is usual in the case of pension funds and 
friendly societies. Here, if the data are at all adequate, better 
results will be obtained therefrom than by having recourse to 
any general experience. Wliere it is insufficient by itself as 
a basis for statistical tables it may serve as an indication as 
to what standard table is the most suitable to employ and as 
to how far and in what direction it may be desirable to 
introduces any modifications therein. In an experience of 
this character the data may sometimes be very heterogeneous, 
but there is usually the safeguard that its composition is 
approximately constant. 

A question of some importance may here be considered, 
namely, the relative claims of lives, policies, or amounts 
assured to form the basis of the mortality table. In the 
17 Offices^ data, the number of policies, in the and 0^^ 
data, the number of lives passing under observation 
constitute the basis of the experience, while in the American 
Offices^ Experience (1880) the sum assured was the unit. In 
the instances of the and 0^^ Tables, wherever a life would 
have been doubly observed the duplicate assurance was 
eliminated. In justification of the use of the sums assured 
.as the basis of the experience, in lieu of the number of lives, 
it may be said that in this way we represent the financial 
effect of the mortality, as it makes, no difference to the 
insuring company whether one claim arises for M 0,000 or 
one hundred claims for £100 each. There are, however, serious 
objections to employing the sums assured as a basis for a 
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mortality table^ based upon a general experience. Either the 
mortality among the lives carrying large sums assured is 
similar to the average or it is not. If it is similar, the 
general character of the table will not be affected by the 
additional weight given to these lives in the experience, but 
the irregularities in the deduced rates of mortality will be 
eonsiderably increased. The result, indeed, will be virtually 
the same as if we had used a part only of the 
available data, selected at random, instead of the 
whole. If, on the other hand, the mortality among the 
lives insured for large sums is materially different from 
the average, then the experience is not homogeneous. 
As a matter of fact, these lives of themselves do not form a 
homogeneous group. In certain societies they appear to give 
better rates of mortality than the average; in others, where 
they are mainly represented by non-profit policies effected for 
commercial reasons, they are no doubt subject to higher rates 
of mortality than the average. As in a general experience, 
combining the individual experience of many oifices, these 
lives will represent an exceptional or abnormal element, 
which may or may not persist in the future, and will certainly 
not persist equally in all societies, it is not desirable in 
deducing a general mortality table to specially wgight up 
this part of the data. 

The same considerations apply, but with somewhat 
less force, to the plan of making policies rather than 
lives the basis of an experience. Without dogmatizing 
upon the point, it appears to me that the proper course is, 
where two or more policies are effected at the same time or 
at the same age at entry, to treat them as a single risk, 
but where the subsequent policies are effected at later ages, 
involving fresh medical selection, to treat them as separate 
risks. This means the elimination of duplicates in each of 
the select tables for individual ages at entry, but no 
further elimination in the resulting aggregate tables, a course 
which has the advantage of making the aggregate table the 
true aggregate of the tables for separate ages at entry. 
Judging by the results of the 0^^ experience, this course is 
necessary if we are to produce an aggregate table, 
representing '"ultimate'’^ rates of mortality after the lapse of 
a stated period from entry, which will join on smoothly to 
the select rates * 
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A detail of less importance^ but of considerable interest, 
is the question of the proper treatment of withdrawals in 
a mortality experience. These are usually treated as 
withdrawing upon the termination of the days of grace in 
case of lapse by non-payment of premium, and for the 
purpose of obtaining the true measure of the mortality 
experienced this course is the correct one. It should be 
borne in mind, however, that to arrive at the financial effect 
of the mortality the numbers of the exposed to risk should 
correspond to the number of annual premiums paid, and from 
this point of view the life withdrawing should not be treated 
at risk dmung the days of grace. The differences in the 
resulting mortality rates according to the two methods is, of 
course, very slight. 


♦ 



SECOND LECTURE. 


Having dealt ill tlie last lecture with the oxitioiiale 
of graduation in general, I now propose to refer more 
particularly to the principles underlying certain special 
methods of graduation. We may divide the various metliods 
which are in use into three classes : 

1. Graphic methods. 

2. Methods based upon Interpolation or Finite 

Difference formulae, such as Mr. Wool h ousels. 

3. Methods which depend upon the use ol: Frequency 

Curves, in which we may include all nietliods 
based upon the assumption that the series to be 
graduated can be represented as some function of 
the variable. 

Certain general considerations apply to all these methods. 
We may have to deal either with a single series of nn'nibers, 
such as the number, at successive ages, of lives effecting 
assurances, of persons enumerated at a census, or attacks 
from a given disease, &c. ; or, as more often happens in 
actuarial statistics, the fact of importance may be the ratio 
ietween the corresponding memhers of tioo series of numhers 
as in a table of Exposed to Risk^^ and ^^Died^^, forming the 
basis of the Mortality Table, where the fact sought is the rate 
of mortality at each age given by the ratio of th,e Died to 
the Exposed to Risk, the actual numbers of these being* 
of importance mainly as affording a measure of the 
trustworthiness of the deduced ratio. 

Where only a single series of numbers is involved, tlic 
problem is comparatively simple, and an accurate solution is 
not generally of great importance to the actuary. In the 
more usaaal case where the ratio of the corresponding* 
members of two series of numbers is in question, the problem 
is more complicated. We have a choice of procedure : we 
may either graduate independently the two series of numbers 
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(in the case supposed the numbers of the "^Exposed to 
Risk at each age and the numbers of the “ Died or^ 
disregarding the irregularities in the two series, \ve may 
proceed to deal at once with the ratios only. If each series 
can be satisfactorily graduated, the resulting curves being 
smooth and fitting the ungraduated series sufficiently closely — 
that is to say, within the limits of the errors of observation — 
we may then assume that the ratios of the corresponding 
terms (in the case supposed the rates of mortality) will also 
be within the limits of error. It may also be said that by 
working with the rough facts themselves, rather than the 
ratios between the two, we keep in view the weight of the 
observations at each point of the curve, and are able to see 
at once how far our graduated numbers vary from the 
original, and how far that variation is justified by the number 
of facts at each particular point. There are, however, some 
important objections to this course. In the first place, the 
ratio between the corresponding terms in the two series of 
numbers represents generally a relatively stable quantity, 
whereas the actual numbers in either series, depending as 
they do upon the extent of the experience under review at 
particular ages, are liable to fluctuations of a more or less 
arbitrary^bharacter. Further, supposing the graphic method 
of graduation or the method of finite differences is employed — 
in either case the argument is applicable, although specially 
so in the former — it will be found that each curve will 
contain certain outstanding irregularities, as it is not possible 
entirely to remove all irregularities by those methods. Hence 
in the adjusted ratios two sets of irregularities will be super- 
imposed and a less satisfactory series of values obtained than 
if the ratios themselves had been dealt with. 

A stronger objection, when dealing with a mortality 
experience, to graduating separately the numbers in the two 
series of Exposed to Risk'’' and ""Died" rather than their 
ratio, is that we thereby discard our previous knowledge of 
the nature of the curve expressing that ratio — our general 
knowledge, that is, of the nature of the curve or — 
knowledge which is of considerable assistance in graduating the 
commencement and end of the table where the data are few. 

Where a graduation of both series of numbers^s made, it 
is preferable, indeed necessary if the best results are to be 
obtained, after first graduating the series corresponding to 
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the “Exposed to Eisk'', to re-compute tlie numbers of deatlis, 
lapses or marriages, as the case may be, on the basis of the 
graduated numbers of the Exposures, and to operate upon 
these adjusted numbers. We are in this way less likely to 
obscure the law of the series representing the required ratios. 

Notwithstanding any theoretical objections, there may be 
occasions on which it is more convenient, or even necessary, 
to deal with the two series separately ; whore, for example, 
as in the Registrar-Generars returns of the population and 
deaths for certain occupations, we have not the facts for 
individual ages, but only in certain large groups. The 
ratio of deaths to exposures for each age group are obviously 
not satisfactory approximations to the rate of mortality for 
the central age of the group. In these circumstaaices it 
appears to be best to adopt a plan similar in pi*inciple, 
though not in detail, to that employed by Milne in. gra-dimting 
the Carlisle Table, and to draw curves respectively through 
the parallelograms representing the exposures jind tlu^ <loaths, 
and from these deduce the numbers for individual ages. d.die 
graphic method, however, is not very suitable for this purpose, 
and the use of interpolation formula) does not always givi^ 
good results. It is generally better to make use of suitable 
frequency curves. It will be seen later tluit, \'^ior(^ tlie 
number of groups is rather small, the use of the norinaJ 
frequency curve, with certain modifications, enables us to 
re-distribute the numbers representing the groups of 
Exposed and Died and so obtain graduated munbers 
for each age, and hence from the ratios of these a gradua,ted 
rate of mortality. {8ee the Sixth Lecture, p. 91.) 


We shall now assume that we are dealing, not with tlu^ 
two independent series, but with the ratio between the two ; 
as, for example, with or some aiialogous function. 

We may consider we have three independent estimates of 
the value of q ^ : — 

1st — That derived from the observed ratio of the 
died to the exposed at age x, 

2nd — That derived from the data at neighbouring 
^ ages. 

3rd — That derived from previous experience of more 
or less similar data. 
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The first and second should he suitably combined in the 
process of graduation. The last is, in the nature of things, 
a very vague estimate, and bears a relation to that derived 
directly from the observations, if these are numerous, similar 
to that of a rough measurement by inferior instrumental 
means to one made by an instrument of precision. In such 
case no weight attaches to it. 

There are circumstances, however, in which the a priori 
estimate of the values of become important, viz., when the 
observations at our disposal are extremely few. As the 
extent of our observations diminish, the numbers of exposures 
and deaths becoming smaller, the weight to be attached to 
the deduced values of the rate of mortality become less, and 
a point is eventually arrived at when we obtain more 
trustworthy results by considering to what particular class 
of examined data the experience most nearly conforms in 
character, and falling back upon the results of such related 
experience. 

If we have to deal with a large experience, a somewhat 
similar difficulty arises at the commencement and end of the 
table. Generally speaking, we then derive more trustworthy 
values for the rates at these ages from a consideration of the 
general ^rend of the curve and our previous approximate 
knowledge of its character, than by falling back upon any 
related experience. 


Coming to the principles underlying each of these three 
methods of graduation, we consider first the graphic method,, 
whether in the form employed by Milne or in the preferable 
form employed by Dr. Sprague. This method makes no- 
further assumption than that the series with which we 
are dealing would, if the observations were sufficiently 
extensive, foiun a continuous and regular curve, and that the 
irregularities actually occurring in the lingraduated valuer 
are due to the smallness of the data. 

To Dr. Spi'ague {J.I.A., vol. xxvi, p. 77) we owe the 
most systematic and satisfactory exposition of the graphic- 
method. An essential feature in his procedure is the 
preliminary division of the data (which we nfay suppose 
arranged by years of age) into groups, so selected as to afford 
a steady progression in the average rates of mortality for 
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successiv© groups^ du6 regard being had to the laiige of 
these groups. For examples of the method^, the student must 
be referred to Dr. Sprague^s original papers. This process of 
dividihg the data into selected groups appears at first sight to be 
arbitrary; but it may be justified on the grounds : — (1) That 
in a series of observations such as we are discussing, where 
at each age the results are affected by irregularities or errors 
of observation, a successful graduation will reduce the sum 
of these errors and also the sum of the "'accumulated '' errors 
to zero, or nearly so. Hence if we compute at each age the 
accumulated errors (reckoning from either end of the series) 
these must; in order thS their sum may be approximately 
zero, change sign, thus passing through zero, fairly frequently. 
The data will, therefore, be made up of consecutive groups, 
larger or smaller, in each of which there is an approximate 
balance of errors, and it maybe assumed that, with a suflicicnt 
amount of experience and the exercise of some trouble, these 
groups can be found by inspection and trial. (2) In further 
justification of this procedure, it is to be noted tliat the rates 
of mortality deduced from the average rates in the selected 
groups are used as a first approximation only, the final rates 
being arrived at by repeated comparison of the graduated 
deaths with the actual numbers until a sufficientl}^^ smooth 
curve and a sufficiently close agreement has been obtained. 
At the same time I am not convinced that the use of these 
specially selected groups has any real advantage over the use 
of groups of constant range, as quinquennial or decennial, 
provided the operator recognizes that he cannot look for an 
absolute balance of errors in these latter, but must regard 
them as equally subject to errors of observation with tlie 
numbers at individual ages. 

Assuming it to be practicable to draw a sufficiently 
smooth curve, free from sudden changes of curvature, and 
yet representing the observations sufficiently closely witli a 
due regard to their weight in different parts of the table, 
there would appear to be nothing to object to in the principle 
of -the graphic method of graduation. In practice, however, 
there are certain difficulties. The first, particularly in the 
case of a mortality table, is the question of scale. Anyone 
who has att’^pted to make graphic graduations will, I think, 
have met with this practical difficulty. Whether we graduate 
separately. the '"Exposed to Risk and "" Died or whether 
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we graduate a function sucli as the difficulty equally 
arises. The values of may range in practice from about 
*005 to^ say^ about *5, and at the older ages increase so 
rapidly that the eye does not readily grasp the nature of the 
curve. In order that it may do so, and that the curve may 
be drawn and read off with sufficient accuracy, a certain 
proportion must be maintained between the horizontal and 
the perpendicular scale, so that the curve shall not cut the 
ordinates at too acute an angle. It is also necessary to 
represent the values of in two or three sections, as the 
scale suitable to the older ages will not permit of the values 
at the younger ages being represented vnth sufficient 
accuracy. 

Instead of operating on the rates of mortality, we may 
with advantage employ the logarithms of the rates, or the 
logarithms of the central death rates.* We thus obtain a 
curve which is much more easily dealt with. From the fact 
that the rates of mortality change slowly at the younger 
ages, and at the older ages generally approximate to a 
geometrical progression, the logarithms of the rates are 
nearly in the form of an arithmetical progression, and are 
represented by a line having very little curvature. At the 
oldest ages, indeed, it may very conveniently be taken as a 
straight line. 

Perhaps the main difficulty in graphic graduation is that 
it is by no means easy, even with mechanical aids, to draw a 
sufficiently smooth curve. The curve as drawn may appear 
to be smooth, but on reading it off and examining the series 
of values obtained, we find irregularities which, in order to 
produce a satisfactory graduation, must be removed by a 
further adjustment. If we are dealing with a relatively 
small experience — in which cases these practical difficulties 
are correspondingly increased — they may be overcome to a 
large extent by using as a base line a well-graduated standard 
table representing an experience of similar character. By 
computing the expected deaths according to the standard 
table, and dealing with the ratio of the actual to the 
'' expected '' deaths in successive age groups, we avoid the 
difficulties due to inequality of scale and to the rapid increase 
in the value of the ordinates at the extreme ages.^ The curve 

;S'ee, however, Note B, j), 114, as to precautions in dealing with logs of rates 
of mortality and similar fimetions. 
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of ratios, apart from accidental fluctuations, will often be 
found to approximate to a straight line, tlie departures from 
which can be, of course, represented on a relatively large 
scale. In particular, the difficulty arising from the paucity 
of observations at either end of the table will be avoided by 
making each extremity of the curve of ratios terinin<‘ito in 
a straight line, the locus of which will depend upon the 
general trend of the curve in the neighbourhood. The 
resulting values at the extremes of the table obtained in 
this way will be more trustworthy than those obtained 
without the aid of the standard base line.'^" 


Ill finite difference or interpolation methods of graduation 
(of which we may take Woolhouse's as the best known typo) 
the underlying assumption is virtually the same as in the 
graphic method, viz., that the curve is of such a nature tliat 
the ordinary methods of interpolation can be applied. Ihit 
more precisely, Woolhouse's method assumes that for a ra.uge 
of 15 consecutive ages the values of can be reiirescnted 
with sufficient accuracy by a curve of the third order, ix., 
when t is not numerically >7. As this 
assumes the fourth and higher differences of 1^, to be^zero, we 
may write 

^ l25 ^ ^'x+r) 2(Z.r-6 4' + Ix+i) 

'l’7(Za?-3 + ^a?+3) +21(Z^_2 + ^;j;+2) + 24(Za-_l + Z.r+i) 4*25/.^.} 

where may be taken as the graduated value of that 
function, the quantities on the right-hand side of the equation 
being the ungraduated values. 

This formula, which is that used by Woolhouse in the 
graduation of the Table, is of course only one of 
numerous possible formula deducible from the above expression, 
for Others may be found resulting in a smoother 

graduated series, but all the formulae since proposed as 
improvements on his are based upon the same general 
principle. An indefinite number of such formulae can ho 
found, even when the range is fixed.t In particular may be 

See LidstonG, sxx, p. 212. Tlieso renmrks 0 / 1*6 eq^uiillv opplicublc 

to graduation by a finite difference formula (see JI.A., voL xli, p, 89). 

t See Todbunter, J.l.A., xxxii, 378 ; G. F. Hardy, e.7,I.A-/xxxii, 37L 
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mentioned Mr. J. A. Higham's, Dr. Karnp's, and that used 
by Mr. J. Spencer in the graduation of the Manchester 
Unity mortality experience. See the following table 
showing the value of in terms of the ungraduated u’s : — 

Table V. 


Showing the vahces of ivliere hy various well- 

known Graduation JFormulee. 


Distance 
from Central 
Term 
t 

Silencer 

21-term 

Formula 

Karui> 

Higham 

Woolhouse 

0 

•172 

'200 

*200 

•200 

± 1 

•163 

•182 

•192 

'192 

d= 

•135 

•139 

•144 

•168 

d= 3 

•095 

•085 

•080 

•056 

=b 4i 

•052 

•034 

'024 

'024 

=b 5 

•017 

•000 

•000 

'000 

± 6 

-•005 

-•013 i 

-•016 

-'016 

± 7 

-•015 

-•014 

-•016 

-•024 

± 8 

-•015 

-•010 

-•008 

•000 

± 0 

-•009 

-•003 

•000 


=blO 

-•003 

•000 



±11 

•000 





It is clear that no such formula will entirely remove 
the irregularities in the series, and in Woolhouse^s graduation 
of tlie ''.I^ible the outstanding irregulaidties were removed 
by an cm]:)iricaf process similar to that employed for the 
graduation of tlio 17 Offices’ Table, and described in his 
paper (J'.J.d,, vol. xii, p. 140-1). The object aimed at in 
a formula such, as these, should be so to select the coefficients 
oE the terms on the right hand that, while giving an 
expression for tlio value of the central function correct as far 
as the order of dilferencos employed, the formula will 
produce the maximum smoothness in the flow of the 
graduated values. This may bo done by simple experiment, 
or wo may adopt some empirical measure or standard of 
smoothness and thereby compute the most advantageous 
coefficients. We may, for example, adopt as our standard 
of smoothness the extent to which the second differences 
of our gra-duated function are affected by tife errors of 
observation in the original table. 

Applying this standard to Woolhouse’s formula, we have 


€ 
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for the graduated second central difference of (using 
central differences for the sake of symmetry)— 

125 A=r*_i = - + Zx-fi + 4--5 + L—i + 1 OZa-3 

— 11 lx-2 — 2 Ix-l — 22j,. 2^a,.^l 1 1 lx+-2 

4'10Zx+3+ + L*+5 4* + 4 /^. 4 ; — 8 ^'a-+8* 

If we assume that on the average each of the inig-raduated 
values of on the right-hand side of this equation is subject 
to a mean error of ±e, and if we assume that these errors 
may be combined according to the normal law^ then the mean 
error of the entire expression for will be found by 

multiplying e by the square root of the sum of tlie squares of 
the coefficients^ giving 

( y3‘-2+42+p+i2-pi2-j. &c.) _ y bio ^ 

125 125 

In the same Avay it may be shown tliat in Ka,rup^s formula 
the mean error in is about *068^; wln^re c is tlie 

mean error of a single value of u^. It must not l)e supposed 
from these results that the mean errors in the gra,dua.ted 
values of or iix a-re proportionately reduced. Tlio mean, 
errors in the graduated functions when Woolliouso^s formula 
is employed are reduced to about *42 of the mean errors in 
the ungraduated functions^ or are about e(|uiva,leut to the 
mean errors of the ungraduated values correspoiKling to a.n 
experience 5| times larger. The graduated tjil)le bjised on 
the smaller data would, however, be Hmoother tluiii the 
ungraduated table based upon the larger data. (See J.I.A,, 
xxxii, pp. 376-7.) 

Taking a generalized formula, vsuch as 

u'x=aUx + h{Ux-i+U^+,) + c{Ux_i + lCx+2) + <^C. . . H'l(x-/. + ‘l(x + l) 

where represents the graduated value of Ux, a,ud 
assuming that each of the ungraduated valiie.s iix, &c., 
are affected by the same mean error +e, it is of course 
possible to determine the values of a, h, c, &c., ,so that tho 
mean error fe, say, A^u'^-i shall be a minimum. Noting that 
a = 1 - 26 - 2c - &c., and that 6 + 4c + 9d + &c. = 0, in order that 
the foi-mnla may he correct to 3rd differences, an e.xpre.sHion 
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may be found for in terms of &c., with 

coefficients inYolving c, d, . . . it. If the coefficients of each 
term are now equated to zero^ there will be (2^-f3) equations 
of condition with (^ — 1) unknowns, which may be solved 
by the usual method of least squares. 

This is somewhat theoretical, however, as the values we 
should obtain for the coefficients would be generally 
fractional, and the resulting graduation formula would not 
lend itself to any continuous method of computation, as is 
the case with Woolhouse^’s and other similar formulae. An 
alternative would be to fix upon a convenient set of 
summations, and then to determine the function summed 
(called by Mr. Lidstone the operand so that (1) first and 
second differences may vanish — see xxxii, 371, &c.; 

(2) The range of the formula may be what we require; 
and (3) that subject to (1) and (2) the coefficients shall be 
such as to make the mean error in or A^ a minimum. 
This might give a fairly convenient working formula, as 
when once the operand was formed the ordinary convenient 
method of summation would apply. 

If we consider the effect of such a formula of graduation 
upon the outstanding or unbalanced errors of observation in 
a smal? group of ages, we shall see that they are not very 
materially diminished. If, for example, we express the sum 
of five consecutive graduated values in terms of the 
ungraduated values, we shall have, in the case of Woolhouse’s 
formula, 

I' x-2 + "h Z xA- 1 x+lA I ar-f2~ 

“h 115Z;r*4" lOl^x+ihSOZ^P^o) 

+ terms involving other values of L 

Here it is obvious that any systematic or unbalanced error in 
the original group will not be greatly reduced (probably 
to about three-fourths of its amount) in the graduated table. 
While, therefore, finite difference formulas of graduation I 
yield, generally, a smooth curve as regards the progression of 1 
the graduated values from age to age, they have a tendency 
to reproduce any waviness in the origina^ due to the 
unbalanced errors affecting small groups of four or five 
consecutive ages. 
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A question arises in connection Avitli tins method as to 
what particular function should he selected for graduation. 
In the case of Woolhouse’s original formula the function 
operated upon was Practically speaking, except for the 
latter portion of the table, this approximates in result to a 
graduation of the rates of mortality. This may he seen from 
the following relations. Any adjustment of the column hy 
a finite difference formula has, of course, the same effect a,s 
a similar graduation of the d^; column. Since ^wid 

since for the range of ages included in the formula (fifteen 
in Woolhouse's formula, of which, however, only the five 
central ages are heavily weighted) the values of are not 
in general widely different, the graduation of the Z.,. or 
column should give results not materially different from those 
obtained by graduating At the older ages, however, 

there may be significant differences in the results, and I must 
express my preference for the rate of mortality as tlie more 
suitable function to graduate if the observations are duly 
weighted or if proper precautions are taken to avoid 
anomalous results at either end of the table where da-ta a;re 
scanty. 

An objection to the principle of the finite difFercmce 
methods of graduation is that the weight of the observations 
is not allowed for at various ages. This objection is not voxy 
serious, however, as at the commencement and end of the 
table, where it would be chiefly felt, the method is usually not 
strictly applied. It may be noted that if the function be 
graduated, then its rapid decrease in value at the oldest ages 
in the table gives automatically a diminishing weight to thc‘. 
observations with increasing age, but at the same time yiedds 
somewhat irregnilar graduated values. The objection may, of 
course, be got rid of by first applying a smooth series of 
weights to the function to be graduated, prior to graduation, 
and eliminating these factors afterwards. 

A difficulty arises in the use of finite difference formula 
from the smallness of the data at the extremes of the table 
and from the fact that the first 7 or 8 values of the 
graduated function cannot be obtained from the formula. In 
the case of a mortality table there is not so much difficulty 
in dealing ^with extreme old age, because there, as 
Woolhouse points out, if we are dealing with the function Z^ it 
may he taken =0 beyond the limiting age of the table, or if 
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we are graduating the rate of mortality, may be put down 
as equal to unity. As regards the earlier ages, Woolhouse^s 
method is to obtain from the formula the graduated values 
of lx so far as this can be done, that is, to within 7 years of 
the initial age, and to compute the values for the first seven 
ages of the table from the values of Zo, Z 7 , Zs and Ig 
(Zo representing the value of for the initial age) on the 
assumption of a constant third difference. This method may 
in certain cases lead to anomalous results, even negative 
rates of mortality. Mr. Ackland has given an alternative 
method of considerable ingenuity (tJ.J.A., vol. xxiii,p. 357). The 
difficulty may be avoided by assuming values for the initial 
ages, as, for example, a constant average value of or d^y or 
other arbitrary values deducible from the general character 
of the experience. A more satisfactory method would be to 
determine q^ for the first 10 or 15 ages, by the method of 
moments or least squares, on the assumption that it could 
be represented by a first or second difference function. All 
these methods, hovrever, are expedients moz'e or less 
empirical, though they may in practice lead to sufficiently 
satisfactory results. 

The Finite Difference methods of graduation all 
assume J}hat the functions to be graduated may be repre- 
sented for successive small tracts of ages by a parabolic curve 
of the form — 

Ux=^a + hx-\rcx^+ &c. 

We are not bound to assume this particular form of 
function. We can employ the principle of the Interpolation 
method, representing our function by some other form, 
as, for example, corresponding to Makeham^s 

formula. 

The principle of the methods of graduation we have been 
discussing, of which Woolhouse’s is a type, must not be 
confounded with that used by Davies in graduating the 
Equitable experience, nor with that used by Mr. Berridge 
in graduating the Peerage mortality. These latter are more 
nearly allied to graduation by frequency curves than to 
Woolhouse’s method. In Davies^ Equitable graduation, 
curves of the third order are actually fitted ^ successive 
sections of the h column, the values of from 10 to 40 being 
virtually found by a third difference interpolation, from the 
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values Zio, Z20. ^30, ko, those from ho to Z70 similarly from the 
values of ho> ko, k,, ho, and so on. Mr. Berridge^s graduation 
of the Peerage mortality followed a similar principle, except 
that he represented the entire series of values of log h: from 
15 to 75 by means of a; single curve of the sixth order, based 
upon the values of that function for decennial intervals of age. 

As to the relative merits of graphic and finite difference 
methods of graduation, the former has an undoubted advantage 
when the number of facts at our disposal are few. In these 
cases formulae of the type of Woolhouse’s cannot be expected 
to produce very satisfactory results, as in the comparatively 
small section of the curve embraced by the foimiula the true 
character of the curve will frequently be obscured by tlu^ 
errors of observation. These formulae are at their best when 
applied to a table based upon fairly extensive data, and 
presenting a curve without any rapid change of character. 
The advantages possessed by the graphic method in dealing 
with a small experience, owing to its flexibility and its })ower 
of bringing under contribution large sections of the curve at 
once, are, however, still more noticeable when frequency 
cmwes can be suitably employed. 


We have already spoken of the success or Ksufficiency of a 
graduation, but we have not said anything as to wind is the 
proper test of a successful graduation. Before dealing witli 
the general principle of graduation by means of frequency 
curves, it will be useful to consider this question. There 
are obviously two conditions that should be fulfilled by a 
graduation. In the. first place, a smooth and continuous 
progression in the graduated values. This is required because 
we have good reason for believing that if the true values were 
ascertainable, they would exhibit this property. In tlie 
second place we require an adherence to the original data, 
sufficiently close to be fairly within what we may conveniently 
term the errors of observation. 

The standard of smoothness is not easy to define. If a 
formula is adopted representing the ultimate values of 
<lx, or fXx as a function of the age, this in itself secures 
a smooth se^^ies.. In other cases the sufficiency or otherwise 
of the graduation in this respect must be left to individual 
judgment.- The advantages of a really smooth curve are 
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mainly found where it is necessary to resort to interpolation 
or to the use of summation formulae ; and, further, in the 
practical consideration that with a really smooth curve nearly 
all tables calculated therefrom can be suiSiciently checked by 
differencing. 

As regards the second requirement, that of adherence to 
the general features of the ungraduated experience, it is 
easier to set up a criterion. We have already seen that if 
the true value of the probability of an event happening at a 
single trial is the event will, on the average, happen np 
times in n trials, and if there are series of 7 ii, 012, &c., 

trials in which the probabilities of the respective events are 
Ply P2, Ihy &c., then on the average the total number of 
occurrences in such a series of trials will be nipi - 1 - 712^2 + 
+ j &o. That is to say, if the observed occurrences 
are O2, ^3^ &c., then the average value of each term 

(< 9 i— 7 ii^i), {02—n2p^, &c., and consequently of the sum of 
such terms, will be zero.* It is also obvious that the average 
value of the sum of the series ( 0 i— 7 iipi) 4-2 (^0—772^2) + 
3(^3— 72.3^3) +, &c., and generally of the series whose rth 
term is 

will be zero. In the case of a mortality experience these 
quantities (^1—711^1), &c., represent the deviations of the 
observed deaths at each age from the Expected Deaths 
as computed by the true rates of mortality, supposing these 
to be known. It follows, therefore, that we should expect 
the total of such deviations 07 i the average to be zero, and 
ill the same way the average value of the successive sums 
of the accumulated deviations should be zero. Generally, 
if we put 

27 ^= 7 ^o^“ 7 ^l + 7 ^ 2 ^- 7^3 + , &C. 

SS 7 ^ = S^?^ = 7 ll 4 - 27^2 + 37 ^ 3 + , &C. 

= 712 + 4 - 6714 + , &c. ; 

we shall have on the average 

%^{9r--nrPr)^0. 

* Tliis is not the most prolahle value of these terms, altlyJugh in general 
it will he very close thereto. The Actuary, however, requires to consider 
the a'cerage result, not the most probable. 
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We should not expect (assuming the true values of to 
be known) that these sums of the deviations of the actual 
from the expected numbers would actually be equal to zero 
in any given case^ but we should expect in a long series of 
cases that the positive values would approximately balance 
the negative. We do not expect to obtain exactly 1,000 
heads in a series of 2,000 tossings of a coin, but we should 
expect to find that the average number of heads over a great 
number of such series of tossings would be very close to that 
figure. This reasoning leads us to the conclusion that, 
given a successful graduation, we should not only have 
obtained a smooth series, but that the sum of the deviations 
between the computed events (deaths or otherwise) and 
the observed numbers, would be nearly zero, and that 
the successive sums of the accumulated deviations would 
be small. 

It is not necessary in practice that this test should be 
pushed too far. We may be satisfied if the sum of the 
deviations and the sum of the accumulated deviations are 
practically zero ; if the total deviations in successive sections 
of the table {e.g., in quinquennial or decennial groups) appear 
to be, on the whole, within the limits of the errors of 
observation ; and if the total of the accumulated deviations 
changes sign fairly frequently. On the other hand we should 
expect that the total deviations irrespective of sign should 
hot be materially less than their theoretical amount. 
Otherwise we should conclude that the series was under- 
adjusted and that accidental fluctuations in the curve had 
been incorporated as inherent characteristics. 

These tests of a graduation are well known to Actuaries, 
and, indeed, have been very generally employed by them. 
So far as they go, they correspond to the method of moments 
which Prof. Karl Pearson has elaborated and employed with 
such success in the fitting of frequency curves to statistical 
data. It is clear, however, that they can only be employed 
systematically in conjunction with those or other curves 
capable of analytical expression. Using methods of gradua- 
tion, based upon Pinite Difference formulae, such as 
Woolhouse's, we cannot secure that the successive sums of 
the deviatiQ;ns shall vanish, though in general we may expect 
them to be small. Using the graphic method, we can, by a 
gradual process of hand-polishing the curve, reduce the 
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accumulated deviations and their sum to as small a value as 
we please;,^ hiit the process is a tedious one. 

A second test that has occasionally been applied when the 
graduation has been effected by means of a formula^ is that of 
making the sums of the squares of the deviations a minimum^ 
the deviations being either in respect of the graduated and 
observed deaths at each age or those of the graduated and 
ungraduated values of some function such as or log.4^ 
This method^ known as the method of Least Squares is 
used very generally in connection with measurements in 
astronomy and other physical sciences and has given rise to 
a quite extensive literature. It is based upon the assumption 
that if in a given series of observations the relative frequency 
of an error x at each observation is represented by the 
function then the probability of a conjunction of any 


* It may, perhaps, be worth pointing out that if we have obtained a 
smooth curve with a general confonnityto the original facts, but not making the 
:S (deviations) or 5- (deviations) vanish, this may be done by the following plan. 
Assume, for the sake of illustration, that the function graduated is the central 
•death rate m^. Eepresenting by mx the graduated values of that function Joy 
'Ex the “ Exposed to Risk ” in the middle of the year of age and by 6^ the 
observed deaths, let 

♦ 2{mxEx-ex)^A 

:s:\mxEx-ex)=B 

then, if w + (1 + h)mx be the modified rates required, 
a . 2(Ea:) + 52(ExWa;) == - A 


a . 5-(Ea:) + Z'22(ExWx)== 
whence a and h are determined. 

If the table on the whole follows Makeham’s law the use of this form of 
correction enables us to neglect all orders of differences in the preliminary' 
adjustment of or fXx- Formulae may thus be emxoloyed (as for example, a 
simple double summation in groups of 10 values, or, still better, successive 
summations in lO’s, 5’s and 2’s) giving a much smoother cur\'e than when 
account has to be taken of second differences, the resulting systematic error of 
this first graduation being corrected as above. 

In the alternative, if 

a2(Ex) + &2ar(Ex) = - A 

This method may be emplo3^ed in conjunction with Mr, Lidstone’s x)lan of using 
a standard table as a bass line for purposes of graduation, 

I) 2 
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set of errors Xi, x.,, x-i, &c., will be proportional to tlic value 
of tlie product 

, he., 

/;>;x+a.’s+.xs+ . . .\ 

=:e~\ " c" / 

wliicli clearly lias a maximum value when the index of e is 
numerically a minimum, i.e., when the sum of the squares 
of the errors (.ri + ajn + + &c.) is the least possible. This 

expression assumes that the average error, and therefore the 
probability of a unit error, in each observation is the same, 
an assumption which may often be fairly made in respect to 
independent measurements of a physical quantity. If the 
observations are not of the same weight, so that the 
probability of the errors of aji, a’2, &c., in the respective 

measures are 

Q-rila- , 

then the most probable solution will evidently be that which 
makes the sum of these exponents the least possible. 

The assumptions upon which this method is based are not 
strictly in accord with the conditions of a mortalitj^experience 
or similar statistical observation. If the method is applied to 
the deviations between the observed and graduated deaths, 
the objection may be raised that the observations at different, 
ages are not of equal weight, and that the probability of a 
unit error varies at each successive age, while in each cast^ 
the probability of a given error can only be approximate]}' 
expressed by the normal function ^ positive and 

negative errors not being equally probable. It is, of course, 
possible suitably to weight the observations, so that a, 
unit error is made equally probable. For example, if at 
any given age there are n ‘^'exposures'’", and if the true 
pr obability of death is g, th en the ^'standard deviation'' or 
y average square deviation and the probability 

of a difference of x between the exj)ected and observed 
deaths is approximately • the error in the formula 

when X is positive nearly compensating the error when x is 
negative^^ Hence, if the ^'Exposed to Eisk" and ^'Died" at 
each age are multiplied by the factor — 3)] where q 


* See Note G, p. 117. 
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is to he taken at its true or graduated value , then the 
observations may be considered to be properly weighted for 
the application of the method of least squares. 

We shall see in the following lectures that there is an 
intimate relation between the criteria of least squares and 
moments. This will be better discussed after considering the 
question of frequency curves and the process of fitting them 
to a set of statistical observations. 

The inigraduated values of q cannot be used, as tliis would result in undue 
■weiglit being given at all ages where the observed mortality was in excess of the 
average, and insufficient weight “where it was in defect. Consequently, the 
mortality table resulting from this process Avould on the whole overestimate the 
mortality throughout. In other words, the use of. the unadjusted values of q 
introduces a systematic or “biassed” error into the calculations. If this is 
aA’oided, however, a very rougli approximation to the graduated curve of q will 
give weights sufficiently near the 'tnith for practical ]purposes, as a slight change 
in. the relative weights of a given series of observations torodiices 1mt little i*esult 
upon the iinal solution. 


THIRD LECTURE. 


I PEOPOSE in tlie present lecture to consider generally the 
use of frequency curves in relation to actuarial statistics. 
We have seen that the graphic method of dealing with these 
statistics^ as also methods based upon finite difference formuhie, 
assume only that the true law of the series, if known, would 
he found to be represented by a continuous curve amenable to 
the ordinary processes of interpolation. It is often possible, 
however, to see that the ungraduated series can be well 
represented by a curve of a certain distinct character, and 
when this is found to be the case more satisfactory results are 
obtained, particularly where the data are few, by fitting to the 
original series a curve corresponding to its observt^d general 
character, so determining the constants in the equation, of the 
curve as to secure the closest agreement with the ungraduated 
curve. If for example we turn to the series in column (2) of 
Table I, it will be at once seen that the general character of the 
series accords very closely to the ^^normaL^ frequency curve, 
or to some curve having the same genei'al features. When 
we find that, by giving suitable values to the constants, a 
frequency curve can be made to fit the observations within 
the limits of the errors of observation we may be satisfied that 
the graduated curve thus produced is probably a better 
representation of the original than any that would result from 
a graphic or finite difference method of graduation. 

Any curve which exhibits the law of variation in a 
particular function, such as a table of or may be 

considered for our purpose as a frequency curve. The 
expression is usually, howevei', confined to that class of curves 
which experience seems to show to be specially applicable to 
the observed distributions of deviations from mean values in 
statistical tables. We have already seen examples of such 


tables where the frequency of the deviations of measures from 
their mean value follows certain comparatively simple laws. 
Professor Karl Pearson has examined a considerable vai-iety 
of statistical data (mainly^ but not entirely, biological) and 
finds that in practically all the cases examined the distribution 
of the various measurements may be represented fairly closely 
by one or other of the class of curves derived from the 
differential equation 

1 dy hx—x^ 

y dx a—bx—cx^ ^ ^ 

where x represents the magnitude of a given deviation 
from the mean of a series of measures and y the frequency 
of such deviation. 

As this group of curves is of considerable importance, 
though less so perhaps in relation to actuarial than in relation 
to some other classes of statistics, it is convenient to consider 
them first. It is not necessary here to discuss these 
curves analytically ; the student may be referred to the 
original papers of Professor Karl Pearson"^", or to an 
admirably condensed resume by Mr. Robert Henderson in 
the Journal of the Actuarial Society of America^ reprinted 
J.I.A.y xW, 429-442; and to Mr. W. Palin Elderton^s ti*eatise on 
Frequency Curves and Correlation in which Professor 
Pearson^s methods are fully described. The table at the end 
of these lectures, which gives a sufficiently complete summary 
of such of the algebraical properties of these curves as 
are most useful in practice, .is, with some unimportant 
modifications, based upon that given by Mr. Henderson in 
his paper. It will be sufficient for our present purpose 
to give a brief general description of these curves and of 
their use in connection with actuarial data. 

We have already seen that the general character of curves, 
such as those of Tables I and II, is approximately determined 
by the average value of the squares and cubes of the 
deviations of the variable from its mean value ; the former 
giving a measure of the compactness or diffuseness of the 
cxxrve that is of the average extent of the deviations from the 
mean irrespective of their direction ; the latter a measure of 
thqir departure from symmetry, or of the skewp.ess of the 
curve. It will be useful at this point somewhat to extend 


Phil. Trani^.j vol. 186, p. 343 ; vol. 197, p. 443, &c. 
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this general statement, and, before proceeding to a description 
of particular curves, to explain more in dcdail what is 
meant by the ''moments'' of a curve. 

If we suppose "fco represent the e(|uati()n to a 

given curve, x varying between the limits h and fc, the 
total area of the curve will be represented by I, lie 
expression : ^ 

area=j ydx. 

We may suppose, for instance, to give dciiniten(:‘ss to our 
ideas, that the function y represents the nuinbc'rs under 
observation between age x and the number of "years 

of life" observed between these ages being ydv, and the area 
of the curve, the sum of all these quantities, being the tota.l 
years of life observed at all ages. If wc now midtiply 
each value of ydx by the corresponding age x and, divide the 
total of these products by the total numl)er of tlic " (exposed ", 
we shall have the average age of the wliole. but into 
symbols : 

j xydx-^ \ ydiT = average value of .r. ... (2) 

= lst moment of the cuiwe romid 
the ordinate for whic*h = 


= mi,say. 

Similarly, 

^ x‘^^y,dx-^\ average value of 

=r/.th moment roimd ordinate for 
which tfi=0. 

=m«. 


The moments of the curve may be taken round any 
ordinate we please. If, for example, the average vjiliie 
of X as found by equation (2), is Xi, then tbe ordinate 
corresponding to this value of x passes through the centre 
of gravity of the curve, and is termed the "centroid vertical,'^ 
In general it is most convenient to take the value of the 
moments of the curve round this centroid vertical, for which 
obviously th^ first moment vanishes. The expression for the 
^^th moment round this ordinate then becomes : 

{x^Xi)^ydx-^\ ydx—jxn • • • 

J it jf- 


■ - ( 3 ) 
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the average value of the nth. power of the deviations (x-- xi) 
between the values of x and the mean value. When the 
moments of a curve are spoken of without qualification^ 
it will be understood that they are the moments round 
the centroid vertical/^ These moments are^ of course^ 
those already referred to in Lecture p. 7^ as representing 
the sums of the powers of the deviations of x from its 
mean value. 

The following formulae^ which may be readily demonstrated^'^ 
connect the values of the moments round the centroid 
verticaL^ with the moments round the ordinate for which 
= Using the same notation as above_, we have 


/ z-o — '^^^0 — 1 

/4i = 0 


^42 = ^2— (mi)2 


(4) 


/43 = niz 3mim2 -f 2 (mj)^ 

/44 = m4 — 4mi m3 4- 6 (^2-1 ) ^^^2 — 3 ^ 

r r * J 


where the law of the coefficients is sufficiently obvious. 

For the particular family of curves arising from the 
differential equation (1) formulas may readily be found 
for the moments involving the various constants of the 
curves^ and inversely^ the values of the constants can be 
■expressed in terms of the moments. The formulae for 
the higher moments being sometimes complicated, it 
is more convenient to tabulate certain functions of the 
moments, e.g. : 


B - 


^2 




_/3.+4 

'y-^7+3 


from which the constants of the curves may be obtained more 
readily, which are also useful in discriminating between the 
curves applicable to a given set of observations.^ 


* See Elderton, p. 17~19 ; Henderson, J.LA., xli, 431-2. 
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Tlie various curves arising from tlie differential equation 
(1) may, for our present purpose, be conveniently classified 
as under : — 



Class I. 

» n. 

Symmetrical curves. Eange limited. 

„ „ unlimited. 



III. 

Skew curves. 

Eange limited in both 

% - - 


IV. 

directions ; 
Skew curves. 

Eange limited in one 

33 

V. 

direction ; 
Skew curves, 
direction ; 

Eange unlimited in either 


tlie various types of curve being as follow. It will be seen 
that some of these Classes are repesented only by a single 
type of curve : 

Class L Symmetrical curves of limited range . — In this 
class we have only the single curve. 


r /yy2\ni 

Typel. y = 

The values of x range from q-a to —a, for either of 
which values of the variable y becomes zero. 

The average value of x is obviously zero, the corresponding 
ordinate y is a maximum, and clearly bisects hhe area Enclosed 
between the curve and the axis of x. In other words, the 
^^mean^^, ^^mode^^ and median of the curve all coincide, 
as in all symmetrical curves. 

The second moment of the curve 

— — 

^ 2m + *6 

and the standard deviation 


The fourth moment 


a 

2m -f- 3 


= jCi4 = 


27)1 + 5 ^ * 


The value of m will usually be positive when y equals zero at 
both limits. If m > 0 < 1 the curve cuts the base-line at an 
angle. If m is negative the value of y becomes infinite at 
both limits, a^d m is always > — 1. 

This curve has a close relationship with the symmetrical 
point binomial curve, whose terms are proportional to the 
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terms in the expansion of (i + 4)% the general term of which 
may be written 



[It will^ of course, be understood that the /c’s in these 
formulae, and in others, are not identical, but 
simply stand for some constant in each case, the 
numerical value of which is determined by the 
area of the curve.] 

The binomial curve, however, can be conveniently used 
only to represent the definite points corresponding to integral 

values of ^ whereas Type 1 represents a continuous 

curve (Note D, p, 122). The data with which an actuary has 
to deal are generally in the latter form, for example, the 
numbers living, the number of deaths, withdrawals, &c., 
between the ages x and x+1, and although usually the 
number of terms in the series is so considerable that the 
curve may be treated as a series of points, on the other liand, 
a binomial having so many terms will not generally be found 
a suitable curve to employ. In most instances where a series 
can be fairly represented by the symmetrical binomial, it can 
also be fairly represented by Type 1, with possibly some 
slight difference in range, as will be seen later. 

There are other symmetrical curves of limited range, 
which are in the nature of frequency curves, but which do 
not belong to the family of curves derived from equation 
(1) : such, e.g.j as the curve 

y=zKe 

which, however, we need not discuss here. 

Glass II, Symmetrical cuo'ves of 'Linlimited range , — In this 
class are two curves belonging to the family with which we 
are dealing. 

Type 2. . (5) 

This is the curve of facility of error or the normal 
frequency curve. 

The average value of x is clearly zero, corresponding to 
the mode or the maximum value of y, and to the median. 

The second moment and iJie standard 
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Type 1 evidently transposes into tliis curve when the 
value of a, and hence the range of the curve, is made 

indefinitely great. If we put =<i-, making lioth «'■* and m 

indefinitely great, but their ratio finite, we have 

(«) 


Liniit 


(1-5) 




Even when the range of the curve is not groat, that is 
-when TO and are not large numbers, there is a fairly close 
agreement between curves of Types 1 and 2 and the symmetrical 
binomiaL 

This may be seen by a numerical examplOj the following 


table showing 

1. The values of ^ = 

these values being proportionate to tlie terms in 
the expansion of the binomial * 


— {qy integral values of 


2 . 

3. 


The values of i/=993 



The values of i/=l,0.26e ^ 


the constants in the two latter curves being choscir t(.) give 
as good general agreement as practicable with tlie binomial 
curve. 


Table VI. 


Allowing Similarity of Types 1 and 2 to the Symmetrical Point 

BinomiaL 


Values of 
Variable 

Binomial curve 

36000 

Type J 

Typo 2 

X 

^ |34-:r|3-a? 

(0 

C-i) 

(y) 

(*1) 

~4 

0 

2 

6 

-3 

50 

47 

56 

-2 

300 1 

303 

282 

-1 

750 

752 1 

743 

0 

1,000 

993 

1,026 

1 

750 

752 

743 

2 

300 

303 

282 

3 

50 

47 

66 

4 

t 

0 

2 

6 

1 Totals 

3,200 

3,201 

3,200 
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Had tlie range of the curves been greater^ the binomial 
being taken to a higher powei% and the values of the 
constants a- and m in col. (3) and of cr in col. (4) been 
larger, the agreement of the three curves would have been 
correspondingly closer. As it is, the two first curves are 
very nearly identical, while the normal curve, although 
theoretical^ of unlimited range, is fairly close to the 
binomial, the terms corresponding to values of x numerically 
greater than 4, amounting to less than 1 in the aggregate. 
It will be noticed that the values of y in the limited curves 
necessarily diminish more rapidly as the limiting values of x 
are approached, while the normal curve is less flat in the 
centre. 

Types. 2/ = ^(i+^2) (7) 

This curve, which is also symmetrical and unlimited in 
range, diverges from the normal curve in a direction opposite 
to Type 1, the values of y diminishing, when x is large, more 
slowly than in the normal cmwe. The curve transposes into 

the latter (Type 2) when and m are indefinitely large, ^ 

being, however, finite. We then have 

Lt. fc\l + J ” . 

The average value of x in the curve y is 

zero, corresponding again to the ‘^mode^^; the second 

moment = ^ and the standard deviation 

2m — 3 

= — - - - — . The fourth moment and, it is 

v2m—3 Zqu — o 

5 

clear, becomes infinite unless m> Indeed, the higher 

moments of the curve must become infinite whatever be the 
value of m. 

The classes of symmetrical curves are of somewhat limited 
application to actuarial statistics, although there are certain 
cases in which they represent the observations fairly well. 

Class III. Skew curves. Range limited in both directions . — 
There is only a single curve of this class in the family of 
curves we are considering, namely : 



Type 4. 


( 8 ) 
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The values of x range from —a to +a; the mode is 
at x=^— — —*a, for which value y is a maximum* the 

mi + ^2 

mean value of x is • a- The expressions for the 

mi + m2+2 

moments of the curve are simplified by putting it into the 
form given in the table on p. 140. If we write mi =np — ly and 
m 2 =nq--l (where p-hq = l), the equation to the curve 
(which does not^ of course, change in character with this 
transposition) becomes 

(») 


the variable having the same range of values — a to -}- a, the 

{q—p)ct; the average value of 
4ipq 
1 


mode being at x 
x-{q 


2 

p)a; the second moment 


* a, and the 


standard deviation the square root of this quantity. 

When this Type evidently transposes into 

Type 1, and thence into Type 2 when m is infinite. 

This curve is related to the skew point binomial arising 
from the expansion of {p-^qj^^ where p and q have 
approximately the same values as in equation (9), and 
where the index of the binomial is not toor small, feere is 
a fair numerical agreement, as may be seen in the following 
table, where the figures given in col. (2) are proportional to 

2 \^ 

the terms in the binomial expansion of + gj : — 

Table VIL 


Showing Numerical Similainty of the Curve of Type 4 with the 
Skew Binomial. 


Value of 
Variable 

X 

Binomial curve 

5760 

Type 4 

y = K(4.-7S -a;)«H(6-25 + a:)”™ 


(1) 

m 

(8) 

-4 

0 

0 

-3 

1 

1 

— 2 

12 

13 

-1 

60 

61 

0 

160 

159 

1 

240 

240 

2 

192 

394 

3 

^ 64 

60 

4 

0 

1 

Totals ... 
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It will be seen tliat for so small a value of n as 6 tlie 
binomial curve can be closely represented by nutans of 
selected points in the continuous curve of Typo 4. When the 
value of n is large^ a much closer agreement is o1)tarina,l)le. 

The skew binoiniiil is of importance to the a.etuary as 
representing the law of the deviations betwinm tlu' jictual 
number of events observed in a given stu'ies of trials and the 
expected number when computed l)y tlu^ triu^ vaJuo 
of the probabilities. Tliere are very maaiy statistical) 
distributions capable of being well represented ])y the 
binomial curve if the latter is treated as a continuous curvtn 
This procedure is not, however, convenient in practice, as it 
rarely happens that the given ordinates coincide witli the 

integral values of x in the general term ^ and, 

^ ” j.r [n — cr-Vc/y ' 

moreover, the analysis, when the curve is t.rc‘a,i.t‘d as 
continuous, is not very simple. {See Note D, p. 122.) 

The form of curve corx'espouding to ^I\ype -t vari(\s vi'ry 
considerably with certain changes in tlu^ vahu's of tlie 
constants mi and ma. In its more usual form, wluui bolh 
mi and are >1, as in Table VII, the curve Ix'uj's a, 
general resemblance to the ago distribution oT ilu' 
^^entrant^^^ in mortality, or similar expericmci^ {see 
Table II), also to the numbers of the exposed to risk ; to 
the number of marriages, or to the rat(^ of nuuTiago at 
various ages; to the average number of childnm under ages 
or to the cost of their pensions at the death of llu^ fa/tlu'r, a 
function of use in pension fund valuations; to the lunnbeu’ 
of retirements in such funds whore superajuiual.ion occurs 
on invalidity and not at a specified age; to tln^ incidence 
of attacks, or of mortality, from certain eliseasos, &c. Owing 
to the number of constants involved (a,s the iiuu-(uu(‘nf< of r 
may represent any period of time, there a, rc^ virtually 
the curve is very adaj)tal)]o. 

It will be readily seen that if the values of boUi vxj and 
7ru2 in ecpiation (8) are high th(^ (uirvi^ ina.k(\s vtu’y clost^ 
contact with the axis of m at either limit; if a/a lies 

between 0 and 1, the curve meets the axis of ,r atian anglt- 
whereas, if either or betli of them are negalivt^, tlu^ c^^:p^^‘ 4 .sic^ 
becomes infinite at oiu^ or both limits, 'riie ar('ai>r tlu^ curvi^ 
and the moments do not, however, become inlinit(‘ if bcfli vii 
and m 2 are greater than —1. 
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Cla$s IF. Skeiv curves. Range limited in one direction . — 
There are two curves of tins class. 

Type t), = (10) 

wliicli is a limiting form of curve No. 4t, tlie values of x 
ranging from 0 and co . 

The ^^niode^^ is at a? = ?? 2 a; tlie mean value of 'x is 
(m4-l)a; the second moment (m + l)a^; and the third 
moment 2(m+l)a^; these being sufficient to determine the 
constants. 

In the usual form of the curve, that is when m>l, this 
curve represents fairly well some of the statistical distributions 
represented by curve No. 4. Owing to tlie feature that as x 
becomes large the successive terms have a ttodency to run 
into a geometrical progression, it is not so well suited to such 
distributions as that of the exposed to risk where the effect 
of the rapid rise in the rate of mortality at the older ages 
makes itself felt in an increasingly rapid diminution in the 
values of y. This is somewhat unfortunate, as the curve is a 
simple one, determined by the values of its first three 
moments, and except for the reason stated, well suited for use 
ill connection with Makeham^s formula for the force of 
mortality. 

As in Type 4, the character of this curve may be entirely 
changed by an alteration in the values of the constant m. If 
this constant vanishes the curve becomes a diininishing‘ 
geometrical progression ; while for negative values of m the 
curve becomes infinite at the lower limiting value of x. The 
value of m must in any case > — 1. 

The actuary has to deal with several distributions roughly 
similar to a diminishing geometrical progression as, for 
example, the curve of infant mortality, the rate of withdrawal 
in successive policy years, or the difference between the select 
and ultimate mortality rates in a select mortality table. Other 
expressions giving a similar form of curve may be employed to 
represent these distributions as, for example, y=^/c + 

with a minimum value of xa when x is very large ; or 
y:=/c{x-ha)~'^\ where if a is small we have a curve again 
similar to that of infant mortality, x representing the age. 

’''(^ 4 - ]) ... . . . ( 11 ) 

where the limiting values of x are a and cx) , with an 
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average value of mode occurring 

at aj= — a. The expressions for the moments are much 

7^2 —mi ^ 

simplified by writing the equation to the curve in the form 
given in the Table on pp. 140-1. 

Type 7. y = (12) 

Where x varies between 0 and oo ^ having an average value of 

— — . with the ^^mode^^ at x=—. The second moment 
m — l m 

Ijl 2 =- TTTijT rrr ^^<3. the standard deviation 

^ (m — 2)^(m--o) 

conse,nen%=^— 

Here m must be > 3, or the second moment becomes oo 
and the fourth moment becomes infinite unless m is greater 
than 5. 

Neither this nor the preceding curve are of any wide 
application in actuarial statistics^ owing to the fact that the 
values of y for large values of x diminish with increasing 
slowness ; m features not often met with in practice except in 
such a function as the rate of withdrawal.^^ The same 
remark holds good of the single curve constituting Class V, 


Glass V. Shew curves. Range unlimited in either direction. 

Types. + y (13) 

This is the only skew curve of this family having 
unlimited range. The average value of x= 

^^mode^^ is at ^a, 

2m 

The expressions for the moments and their functions are 

simplified by writing 4- 1^ for m in equation (13)^ as in 

the Table on pp. 140-1. For the reason stated above;, the curve 
is not specially useful to Actuaries. 


E 
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Assuming that a given statistical series can be represented 
bj one or other of the curves above described^ the a,ppropriate 
curve can be found by means of certain criteria based upon 
an examination of the moments of the curve; that is to 
say^ the sums of the powers of the deviations from the mean 
value. These criteria are furnished by the table on pp. 140-1^ 
above referred to. 

As the calculation of the criterion is somewhat lengthy, 
it may be noted that if the logarithms of y are tabulated 
for equal intervals of the variable Xj and the values of 
A^logy taken out, these give us information as to the 
nature of the curve. The value of A'-^logy will be 
constant and negative for the normal curve Type 2 ; 
negative and symmetrical with a minimum numerical value 
in the centre of the range, for TyjDe 1, or for any binomial 
curve; uniformly negative, non-symmetrical, and with, a 
numerical minimum in the case of Type 4 (where tins 
curve vanishes at the limits) ; and unifoimily negative and 
continuously decreasing towards the upper limit of x in the 
case of Type 5, where this curve vanishes at the limits. 

In the case, therefore, of those curves most useful to the 
Actuary the function A^ logj/, computed for the ungraduated 
curve, enables us to select generally the fcgnnula m 4 )st suited 
to the series. For this purpose if the data are grouped it 
will generally be better to compute the approximate values 
of the central ordinates of each group by an interpolation 
formula, such as that given on p. 57. 

Other types of curves will sometimes be found useful 
besides those arising from the differential equation on p. 39 ; 
but they do not generally lend themselves so readily to the 
method of moments. 

If, for example, we write 

m 71 \ 

a+aj*^&+aj/ 



we obtain, when m and n are numerically unequal, a skew 
curve vanishing when —a or —6. We may deal Avith this 
curve in practice by determining the values of equidistant 
ordinates as shown on pp. 57-8. Thus 




\ogy=K 


m n 
a+x 6 + 03 


(15) 
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As logy becomes — co at the limits, we multiply both sides 
by [a + x){h'{-x)^ thence 

w\ah-\-{a+'b)x-\~x^'] 

= /c'[a6+(a + Z))a? + .^’“] —m{h + x)‘-n{a^x) 

= A + Baj + C^'^ (say) (16) 


where the unknowns are a, b, A, B and C. 

If we difference three times the right hand side vanishes 
and we have a series of expressions involving (ab) and (aH-fe) 
equated to zero and by suitably grouping these, or by using 
the method of moments a and 6, and thence the remainino* 
(Constants, may be evaluated. 

A similar process maybe employed with advantage with a 
eurve such as the usual form of exposed to risk or died, when 
the data are in large age groups. We may then take w in 
equation (15) to represent the common log of the ratio of the 
numbers above age x to the numbers below age x in the series. 
That is, if the total number in the series =:lSr, the number 
.above age x =Y, we may write 






m 


n 


CL X h 


(17) 


In many cases the constant K' may be omitted if the 
number of groups is small; in this case C in equation (16) 
becomes zero. On the other hand it may sometimes be found 
necessary to add a term to the right hand of equation (16) 
involving 


E 


2 


FOURTH LECTURE. 


We shall now consider very shortly the problem of fitting 
frequency curves to statistical data. To do this at length 
would be impossible in the time at our disposal^ and the 
student who wishes to pursue the subject in detail may read the 
original papers^ already referred to (p. 39), of Professor Karl 
Pearson, to whom the development of the subject is due, 
or Mr. Elderton^s book. There are certain general principles 
however, which may be usefully considered. The method 
usually employed in fitting these curves is by making the 
moments of the graduated equal to those of the ungraduated 
curve, which is 'equivalent to making the quantities 
S (deviations), (deviations), &c., as fa/ as S'* o^ equal 
to zero. This method may not always be the most convenient 
or the best for the purpose of the Actuary, but it is so 
for most statistical purposes, and has come much into use 
accordingly. 

We have already seen that, in the case of the curves 
arising from the differential equation on p. 39, expressions 
for the moments may be obtained in terms of the constants 
which will enable us to determine the value of the constants, 
when the numerical value of the moments is known. For the 
purpose of fitting the appropriate curve to any given series 
of observations it is only necessary to determine the value 
of the moments as given by the observations, that is, the 
value of the sum of the squares, cubes, &c., of the deviations 
from the mean value of the variable. 

It will be useful to consider shortly the calculation of the 
numerical value of the moments in a given instance. Take 
first the sknplest possible case where we have to do not with 
a continuous curve, but with a series of points representing 
isolated ordinates, where in consequence we replace Integra- 
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tions by summatioiis. In tbe following table, the first column 
contains the values of the independent variable x, the range 
of values being from 0 to 6. The second column contains 
the values of its function y, which are proportionate to the 


successive terms in the expansion of the binomial 



the constant multiplier 729 being introduced merely to avoid 
fractions. The remaining columns, in which the average 
value of X and the values of the successive moments are 
worked out, explain themselves. It may be remarked that in 
this example the average value of a?, and the deviations from 
the average, are all integral, and it is therefore convenient 
to calculate at once the moments round the average value 
centroid vertical In most cases, however, the average 
and the deviations will not be integral, and then it will 
be more convenient to calculate the moments round the 
origin or some selected middle value of the variable, 
afterwards transferring the moments to the mean by the 
formulae given on p. 41. 


Table VIII. 

Moments of the JBoint Binomial Curve. 
729 . 

15 


|6— ^\3/ \d) |^j6— a; 


X 

y 


{x-A)y 

{x-4)-y 

(ar-4)V 


0 

1 

0 

- 4 

16 

- 64 

256 

1 

12 

12 

- 36 

108 

-324 

972 

2 

60 

120 

-120 

240 

-480 

960 

3 

160 

480 

-160 

160 

-190 

160 

4 

240 

960 

0 

0 

0 

0 

5 

192 

960 

192 

192 

+ 192 

192 

6 

64 

384 

128 

256 

+ 512 

1,024 

Totals 

729 

'2,916 

0 

972 

-324 

3,564 

Totals 

1 

4 

0 

4 

4 

44 



mean value 


3 

y 

0 

-f-729 


of X 






Obviously, when the moments are calculated about the mean 
the first moment is zero (because it represents the average 
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deviation from the average value). The even moments are 
always positive^, because each term is of the form 
i.e., essentially positive ; and if the curve is symmetrical the 
odd moments vanish^ because each term of the form is 

cancelled by a term (equidistant from the mean) of the 
form In general^ where the curve is not 

symmetrical, the third, fifth, &c,, moments will not be zero. 

In the above illustration, we have considered x to have 
integral values only. This may be said to approximate to 
the conditions of many statistical tables used by the Actuary 
where x represents the year of age under observation, and 
where it is indifferent whether the observations are supposed 
to be spread over the year in the form of a continuous curve, 
or whether -we consider them all to have reference to the 
central point of the year. In these cases, however, x will 
generally have a large range of values, amounting possibly 
to 60 or 80, and the labour of computing the numerical 
value of the moments is then much lessened by grouping 
the facts in larger sections, though we cannot then safely 
assume the totals of each group to be concentrated at the 
middle ordinate. 

Take the set of observations in Table IX representing 
for decennial age groups numbers exposed to rispk in the 

middle of each year of age, i,e., E^=Ea.--“ in the recent 

mortality experience of lives assured by ascending premium 
policies,* excluding the first ten years from entry. Here we 
have no longer the values of equidistant ordinates of the 
curve, but the area of the curve enclosed between successive 
ordinates. To obtain the moments of the curve with any 
degree of accuracy, we cannot treat these areas as 
proportional to their central ordinate. 

It will be noticed that the particular curve we are dealing 
with becomes gradually zero at either extremity,t and we may 
assume, without serious error, that it makes close contact 
at either end with the axis of oj, that is to say, is 
asymptotic thereto. In these cases, Mr. Sheppard has shown J 
that very approximate values for the moments may be found 

* See Unadjusted Data, Minor Classes of Assurances, p. 191. 

"f We omit t^e numbers at risk under age 25 (arising from entrants under 
age 15), amounting to only 25 in all. 

X An elementary demonstration is given in Elderton’s Treatise, p. 28-29. 
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by treating tlie area of eacb successive section of tlie curve 
as concentrated in tbe middle ordinate of tlie section; in 
otlier words^ treating tlie values of y as representing isolated 
ordinates exactly as was done in Table VIII; andL.tlien 
applying to the values of the moments so found (denoted by 
the symbol m^) the following adjustments leading to the 
corrected moments denoted by the symbol m : — 


7^1 = m'l 

, 1 

= m 2— j2 

/ I / r J- 

== m 3 — j m 3 r mi 

4 4 

, 1 , 7 ,1 1 


For moments round tie centroid vertical tiese become^ 
remembering that 

, 1 
12 

» tM3=fi'3 

, 2-“* 80' 


Table IX. 

Ascending JPremium Assurances — Uwperienee 1863-1893. 
Duration 10 years and upwards. 


Calculation of Moments of Exposed to Eislc ’’ Cur^e, 



Exposed to 






Ages 

Risk 

V 

X 

xy 

x-y 

x^y 

x^y 

25-35 

2,874 

-2 

- 5,748 

11,496 

-22,992 

45,984 

35-45 

22,020 

-1 

-22,020 

22,020 

-22,020 

22,020 

45-55 

26,164 

0 




55-65 

17,391 

1 

17,391 

17,391 

17,391 

17,391 

65-75 

7,845 

2 

15,690 

31,380 

62,760 

125,520 

75-85 

1,761 

3 

5,283 

15,849 

47,547 

142,641 

85-95 

81 

4 

324 

1,296 

5,184 

20,736 

Totals 

78,136 


10,920 

99,432 

87,870 

374,292 

Beduced 

1 


•13976 

1-2725 

1-124^ 

4-7903 

fco unit area 

X 



=m% 


= m' 4 , 
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rrom these results we obtain by means of the corrections 
above stated — • 

?ni = *13976; m2=l*1892; m 3 = 1*0897 ; m4=4*1832. 


Whence^ by the equations on p. 41. 

1*1697; /^3 = ’5965; /44=:3*7122. 

If quinquennial age groups had been used, making due 
allowance for the unit of time still being taken as ten years, 
the corresponding values would have been 

mi=:*13848; /^2=1*1741; /t3=*5869; /t4=3-7160. 


using these latter values, as the more accurate, we obtain for 
the values of the functions jSi , ^2 ^^ 6 . 7 . 

-21283; A=/^4/a‘%=2-6957 ; 7 = =-7397; 

2 

As /X 3 does not vanish, and 7 is > g, we see from the table 

on pp. 140-1, that if the series can be represented by any of 
the curves there given, it must be by No. 4, excluding the skew 
binomial as unsuitable for reasons already given. It is also 
obvious from the run of the figures in Table iX that the 
curve is limited in both directions. Equating the expressions 
in Table IX with the above numerical valuOs, we have 




3 + 2 
O _ 4(^ + 1) 

in+2rpq 


whence 


•21283 ; 
(jp — g)“=*5453p5r 


(p + g)2=4*5453pg =;1 (since y 4-9' = 1) 

giving _p=-6732; q = -S268 

M^= — • a®=l‘1741 ; whence a=3'293 

71+ L 

thus giving a range of 32*93 years on either side of the age 
for which the value of x in the forinula=0. This has nothing 
to do with the zero point (age 50) in Table IX. The mean 
age as is seen from that table is 50 + 1-385 = 51-385. The 
value of m, tne mean as computed by the above formula^ is 

m, = (g — p) a = — 1 • 1407 
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that is, 11*407 years earlier than the central point of the range, 
giving for the latter, 51*3854- 11*407 = 62*79, say. 'I’he range 
of the curve is therefore from age 29*86 to age 95*72 ; and, 
computing the values of np — 1 and nq—1, we have, for the 
final form of the equation of the curve, when aj = the age : 

^ = /c . (a? -- 29*86) '-^(95*72 


It is often a convenience, however, to have the values 
of the central ordinates of the groups, which may be 
approximately obtained by interpolation. If the numbers in 
any group are represented by the symbol the number of 
years in each group being t, the value of the central ordinate 
of the group (that is to say, the numbers under observation 
exactly at the central age of the group) will be approximately 

As, however, it is convenient to treat the 


interval t as the unit, for the time being, we may write as the 
values of the central ordinates (the original 


numbers for each group less -g^th of their respective central 
second differences). In the class of curves we are discussing, 
namely, those having close contact at both ends with the axis 
of X, the» numerical values of the moments • as deduced from 
these ordinates will be very nearly the values for the 

continuous curve, unless the number of ^ groups is very 

rh 

small. Thus the values of ydx, and of the functions 

xydx, x^^ydx, will be found by taking the sum of the 

ordinates of y, computed as above, and the sum of the 
products xy, x^^y. 

An advantage attaching to the use of ordinates in lieu of 
areas is that, in the class of curves we are dealing with, we 
can, by examination of the differences of the logarithms of 
the ordinates, gain a better idea, of the nature of the curve 
than can be obtained from the grouped figures, [See Third 
Lecture, p. 50.) It is also easier to compare the graduated 
figures as given by the frequency curve by means of isolated 
ordinates than by means of groups or areas. 


* The formula to 4th differences is Uy, ~ + nearly, and in 

24 2l;V 

order that the resulting 4th moment should agree exactly with that obtained 
from the use of the grouped figures, or areas, with Sheppard's corrections, the 
4th difference is required, but for practical purposes it is not often needed. 
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The use of the central ordinates of the groups has 
the incidental advantage, which is very considerable 
in the case of a mortality or similar experience, of 
giving trustworthy values of the force of mortality, or 
corresponding function, for the ages corresponding to the 
position of the ordinates. In the usual plan of summarizing 
a mortality table by giving the numbers at risk and deaths 
in consecutive age groups, the ratio of the deaths to the 
numbers at risk in each group is not a useful function, as it 
does not correctly represent the mortality for the central age, 
except near the middle of the table, where the numbers under 
observation in successive years is nearly constant. 

We may apply this method to the example already dealt 
with on p. 55, viz., the experience of ascending premium 
policies. The calculations as set out in the following tabular 
f orm are sufficiently clear : 


Table X. 

Mortality experience of lives assured ly ascending ^remiums^ 


1863-1S93. Duration 10 yeao^s and upivards. 


\ 

i Central 

1 of group 

: W 

Exposed to 
Risk 

Died 

♦Estimated Central 
Ordinates 

_r. . 

Central Age 

Exposed 
to Risk 

Died 

(I) 

(2) 

(3) 

0) 

(5) 

(6) 

(7) 

25-30 

27*5 

266 

2 

168 

•8 

•0048 

30-35 

32-5 

2,607-5 

31 

2,448 

29-2 

*0119 

35-40 

37*5 

8,788 

102 

8,860 

102*0 

•0115 

40-45 

42*5 

13,232-5 

173 

13,389 

175*2 

•0131 

45-50 

47*5 

13,910 

192 

14,007 

191-7 

•0137 

50-55 

52*5 

12,254 

218 

12,284 

218*6 

•0178 

55-60 

57*5 

9,878*5 

229 

9,878 

228*4 

‘ *0232 

60-65 

62*5 

7,512*5 

255 

7,518 

255*4 

•0340 

65-70 

67-5 

5,007*5 

271 

4,994 

274*4 

•0549 

70-75 

72-5 

2,837 

206 

2,809 

205*6 

•0732 

7o— so 

77*5 

1,347*5 

151 

1,324 

151*4 

•1144 

80-85 

82-5 

413*5 

85 

389 ! 

84*8 

•2180 

85-90 

! 87*5' 

77 

24 

66 

22*3 

•3379 

90-95 

i 92-5 . 

i 4*5 

i 

3 

2 

2*2 

1*1000 

Totals... : 

78,136 

1,943 

78,136 

1,942*0 



A-// 

* Taking 5 years as the unit, computing by formula Ux where 

24j 

Ux represents the nunaber in columns (3) and (4). By this formula there are 
—11 persons exposed to risk at age 22*5; these have been included in the 
group 25-30, 


.’■i ir'lf . J.f).. ^ 
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If tlie values of tlie moments are computed from columns 
exactly as was done with the Binomial Curve (Table VIII, p. 53) 
they will be found to be practically identical with those found 
above. The estimated values of for the central ages of the 
group are inserted as they will be used later. 


In many cases the principle of the method of moments 
may be used to fit a curve to a series of observations without 
actually computing the numerical values of the moments 
themselves, using instead the successive summations of the 
ordinates, or areas, from which the moments can be readily 
obtained if required. This method is also useful if one or 
both limits to the range of the curve can be assumed. 

Consider a scheme such as the following, in which, with a 
view to clearness, we use actual numbers of the series, given 
on p. 53, instead of symbols : — 


X 







X 0?^ 

0 

“ 1 

729 


... 



0 

1 

12 

728 

♦ 

2,916 

7,776 

(6,318) 



12 

2 

60 

716 

2,18 S 

4,860 

9,180 

15,660 

(11,070) 

960 

3 

160 

656 

1,472 

2,672 

4,320 

6,480 

12,960 

4 

2-10 

496 

816 

1,200 

1,648 

2,160 

61,440 

5 

192 

256 

320 

384 

448 

512 

120,000 

6 

64 

64 

64 

64 

64 

64 

82,944 

278,316 


In this scheme, each column is formed from the preceding 
by successive addition from the bottom, in the same way 
that the column is formed from Cj? , and from Ma*. 

If we take the value against x^O in the column say 
27^0 we see that each value of occurs once only in that 
total. In the total appearing against in the second 

summation, say each value of occurs x times; 

similarly the total against x=2 in the colu:^n 'Zhia:, say 


represents the sum of the products 


X (x ““ 1 ^ 




and the 
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total against x^o in the column 2%^;, say 2 % 3 , represents 


the total of the products 


x{x—l){x—2) 


lU 


and. so on^ the 


coefficients following the Binomial law. It is evident from 


this that the sums of the products x\ix 




&c.^ are 


implicitly contained in these totals ; and that if these sums 
of the graduated and ungraduated values are in agreement^ 
the moments of the t^vo curves will also agree. Writing 
as the value of the Tzth moment round the ordinate of x = 0, 
we shall find 




2no 






7723 = 


62^223 -h62% + 2%i 

2z2o 


m4 


242^2(4 4- 362%3 4- 3 42^h^ -f S-iCi 

222o 


These formulae may be simplified if we write th^m in a 
form analogous to central difference formulae — writing, for 
example : 


for 


2^(Wj.j_l + Ux) 
'2 ’ 


these average values being shown in antique type in the 
Scheme. W e then have, omitting the common divisor Xuq : 

7211 = 2-12 1 
m2=22%i| 

??23=62%2 + 1Wi 
7?l4 = 242^112^ + 7712 


The equivalence of the above formulae may be illustrated by 
the following^umerical examples based on the above scheme. 


* See tbe demonstration in Xote E, p. 124. 
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Using U as an abbreviation of Stto=tlie total number of 
observations, we have 

]Sr.mo= 729 = ^“^^0 
N.mi= 2916 = 22^1 

N.m2= 12636=223^2 +^^^1 =2x4860 + 2916 

=223'iiii =2x6318 

]Sr.m3= 57996 = 62%3 +62''U2 +2^^1 = 6x4320 + 6x4860 + 2916 
= 62%2 +22t^i=6 X 9180 +2916 

]Sr . m, = 278316 = 2423z^4 + 362‘'ii3 + 1423ii2 + 2%i = 24 x 21 60 

+ 36 X 4320 + 14 X 4860 + 2916 
= 242H(2?r + 223,^ = 24 X 1 1070 + 2 X 6318 

The last may be compared with the direct calculation of 
cchc^ given in the last column of the scheme. The values of 
the moments through the centroid vertical may be obtained if 
requiredpby the formulae ; 

fMi = 0 

Ij^=iqn2 — (mi)^ 


/is = ^3 — 3 (wi)/i2 — (mi) 3 

/i4 = m4 — 4(7?ii) /i3 — 6 (mi)2^ — ^ . 

Where the number of terms in the series is few, there is no 
special advantage in this method ; but if the number of terms 
is considerable it effects a saving of time, more particularly 
if the calculation of the moments round the centroid vertical 
is not heeded by the conditions of the problem, as in the case 
or the graduation of rates of mortality by Makeham^s or 
any similar frequency formula. 

The case of curves not making close contact with the axis 
of X at both ends requires to be considered separately, but the 
results obtained are not altogether satisfactory, see Elderton,. 
pages 29-30. The difficulty can, however, to a great extent 
be avoided in most cases arising in actuarial ^ork by using 
very small groups, or even individual values for each year of 
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age^ &c., in calculating tlie moments. Tlie labour altbougli 
increased is by no means probibitive if tlie summation method^ 
above described, be adopted. 

Professor Karl Pearson lias sliown"^ that tlie metliod of 
fitting a curve by computing its moments should lead to 
nearly the same results as the method of least squares. If we 
are fitting to a given set of observations an ordinary parabolic 
curve, represented by the equation y=a-{-hx + cxr + &c., then 
the method of moments and the method of least squares are 
identical.t He infers from this fact that, even if y is 
represented by a more complex expression, the numerical 
results from the method will be nearly the same as with the 
method of least squares. It would appear at first sight that 
the effect of the method of moments is to give ec|ual weight 
to each observation or group of observations, in spite of their 
having unequal average errors ,* whereas the method of 
least squares should, strictly speaking, be applied only when 
the average error of each observation is nearly equal. J In a 
mortality table, where the number of persons under observation 
and the number of deaths are relatively large in the middle 
of the table and fall off to zero at the beginning and end, the 
probability of a given error in the value of g is very much 
smaller at the central ages; while, on the^other h^nd, the 
probability of a deviation of a unit in the number of deaths is 
correspondingly greater. The same applies to most tables of 
statistics, as they usually present- a series starting from zero, 
rising to a maximum, and diminishing to zero again, the 
weight of the observations being in the middle of the curve, 
where, however, the probability of a given numerical deviation 
in the actual numbers is also greater. 

We have seen that in a series of numbers representing the 
distribution of a group into sub-groups the average error in any 

given case is approximately *8 a/ — w-here n is the 

number in the group and m the (graduated) number in the sub- 
group. If, as is generally the case, n is large compared to 

* Biometrika, vol. i, p. 266-271. 

t This assumes that the miadjusted moments {m not m') are used, i.e., that 
the niunbers represent ordinates and not areas. If the moments are assumed to 
represent areas and the corresponding corrections are introduced, the method of 
moments no longef^gives precisely the same results as the method of least 
-squares : see examples given hy Todhunter, JJ.A., sli, 414. 

t See Isote C, p. 117. 


# 
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this expression may be taken as equal to *8 \/ average 

error in the ratio ” being approximately Thus^ if 

the number at risk at a given age equals n and the true 
probabilities of death and survivorship^ are q_ and then 
•8 (which as ^ is nearly unity for the greater n innbe r 

of ages may be roughly taken as •SvWmber of deaths)^ 
is an approximate expression for the average deviation from 
the expected number of deaths. The method of moments^ 
if employed to represent a> given series by a parabolic curve, 
assumes an equal probability of unit error in each term of 
the series. If, therefore, the series is of such a character 
that the extreme values are relatively small, these parts of 
the data will have somewhat less than their due weight in the 
fitting process. If, however, the formula to be fitted does 
not represent a parabolic curve, but a curve analogous to the 
normal curve say a curve of the form 

then it will be found that, on the assumption that the mean 
error in any value y is equal to \/ yi (where yi represents the 
graduated value of y) the method of moments gives the same 
result as^the method of least squares when the observations 
are duly weighted {see Note F, p. 129). 

We come now to the class of curves representing not 
the actual numbers in statistical tables, but the ratios of the 
corresponding numbers in the double series, such as those of 
tables of Exposed to Risk and Died curves, that is, 
representing such functions as rates of mortality, of marriage, 
of lapse, of superannuation, &c. The most interesting and 
important of these is the curve due to Makeham^s development 
of Gompertz^s hypothesis, in which the force of mortality at a 
given age x is represented by the expression 

A + Bc^ 

leading to the equation 

logioZ;.=E:+A'aj + B'c^. 

This curve has a double value as, apart from its use in 
graduating a mortality table, it has the valuable property 


* See Note A, p. 110 ; xxvii, 214. 
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tliat the values of annuities on n joint lives of various ages 
eaii be found from a table of single entry showing the values 
of aiiniiities on n lives of equal age. Owing to its importance 
it will be useful to give some attention to the problem of 
fitting this curve to a mortality experience. We tvill first 
consider the case of an aggregate or non-select table^ that is, 
a table in which the rate of mortality is a function of the age 
alone. 

Various methods have been employed to obtain the values 
of the constants A, B, c, corresponding to a given experience. 
That used by Makeham, and subsequently in a modified form 
by Woolhouse,is based on selected values of log taken from 
a table already graduated by a finite difference formula. 
Four values of logZ^ maybe taken, covering practically the 
whole of adult life, say the values at ages 20, 40, 60, and SO, 
or 25, 45, 65, S5. Either set are sufficient to determine 
the four constants, K, A', B' and c, as above. In Woolhouse's 
graduation of the Table, both of these sets of ages were 
employed, the most advantageous values of the constants 
being found by comparing the deviations between the 
graduated and ungraduated values of Ij; at quii>quenmal 
ages according to the two preliminary graduations. If a 
single set of four values of is taken the bag^s of the 
graduation, the effect is the same as emplopng the sums of 
the forces of mortality (/Xjr+.x) between the selected ages, 
giving equal weight to the values at each age. 

• The method employed by IMr. King in the Institute of 
Actuaries’ Text-Book, Part II., substitutes for graduated 
values of log Zj. at isolated ages, the sum of certain 
groups of the ungraduated values of logZ^r. The effect 
of this method would appear to be to give a diminishing 
weight to the values of for the ages at the commencement 
and end of the table, which is so far in accordance with 
theory, and to eliminate the effect of errors in isolated 
values of Z^. In Biometrika (vol. i., p. 298-303) Prof. Pearson 
has dealt with the same problem, basing the values of the 
constant upon the successive summations of logZ^r. 

It is, perhaps, preferable to deal directly with the actual 
exposures and deaths in a manner similar to that first 
described by Makeham (/.I. A., vol. xYi,p. 344). This can 
be readily drone, and the same method of snmmations or 
moments applied as in the case of any other frequency curve. 
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Tabulate that is, the number exposed to risk in the 

middle of the year of age Xj and da» representing the deaths 
occurring between ages x and Assuming, as we may with 

sufficient accuracy for ordinary purposes,* that the force of 
mortality at ag^e tC+i, or the function colog ePxj is equal 

6 

to nix the central death rate w have 

E;„+x(A+Bc^+i) = ^;«. 

If we knew the value of c, Ave could then tabulate the A^alues 
of 6x respectively, and summing these values 

continuously to the end of the table, and again taking the 
total of these sums, Ave should obtain equations in this 
form : — 

A + (S(E.,+.c-+-J) ) B = (te,) 

(SSE^.+.j) A+ (SXE^+ 40 *-"^)B = 

a simple simultaneous equation foi' determining A and B. 
As a matter of fact, the value of logioc does not usually 
differ very much from *04, and in general it Avill be found 
that a suJlIl change in the A^alue of log c does not involve a 
serious change in the general character of tlie table. In a,n 
importanirseries of' observations, hoAvever, we cannot assume 
the A-^alue of c. Either Ave must deteimiine c by a method 
such as that used by Mr. Woolhouse or Mr. King, which Avill 
give a sufficient approximate value, or Ave may adopt two or 
more alternative values of c, Avhich appear likely to contain 
between them the true value. Having obtained the values 
of constants A and B for each given value of c, set out 
the expected or graduated deaths, and compare them Avith 
the actual numbers in suitable age groups. If the 
thii'd summation of the differences of the graduated and 
ungraduated deaths is computed, it Avill be possible by 

* Assuming the usual table of and Bx to represent accurately the facts 
and to be undisturbed at the older ages (where alone the point is of any 

importance) by entrances or by exits other than by death, then f = 2 -^, 

acnrately^ and colog t ? — Ta ^ i very nearly, where mx is the 

0 

central death rate ” , The error caused by omitting the small term 

' 0 

in the denominator and taking colog is only a]Tpreciable at the 

-“a: 

older ages, amounting to .1 per-cent in the rate of mortality where {/a? -'3 
or about age 90. 

P 
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interpolation to obtain a yalue of log c, making these nearly 
equal to zero. Putting the matter into the language of 
moments^ we shall then have made the first, second and 
third moments of the graduated and ungraduated curves 
equal, and in that way we shall have selected what may be 
considered the best values of the constants A, B and c.* 

It may be objected that the use of this particular method 
is open to the same implication of giving equal weight to all 
the observations, as in the case of the values of We can 
avoid that objection by duly weighting the observations at 
each age by multiplying the exposed and ^^died^'’ at 
each age by the approximately graduated values of 
But although this would give suitable weights to the 
observations, if the curve of mortality ivere a parabolic 
curve, or if it were known to follow accurately Makeham^s 
Law, it is not quite clear that it would do so in practice. 
It may be assumed that (when the constants are formed 
by reproducing the moments of the deaths) in not 
weighting the observations, we give less weight to those at 
the commencement and at the end of the table than they are 
theoretically entitled to. But this is not a serious"^ practical 
objection. Makeham^s law is only approximately correct, 
and as we reach younger adult ages it begins to diverge from 
ihe facts of observation ; on the other hand, as we reach the 
older ages the actual importance of the observations is less 
than the weight to which they are theoretically entitled, as 
estimated by the number of deaths, owing to the fact that 
the actual mortality at those ages does not materially affect 
financial questions such as rates of premium and reserves. 

Beyond this consideration there is also a degree of doubt 
attaching to the rates of mortality at extreme ages in any 
table.t Indeed, we may go further, and say that in all 
considerable tables of statistics the numbers at the extremes 
of the table are proportionately more affected by sporadic or 
accidental errors of observation than those in the body of the 
table. If we suppose that in a very small percentage of 
cases the ages of the Exposed to Eisk and Died are 
affected by errors of calculation, clerical errors in tran- 
scribing the data, &c. — ^these cases being removed from their 
true position and scattered at random over the table — the 

* See Kote O, p. 131. 

f See my notes on this siibject in “ Principles and Methods ”, p. 148. 
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effect upon tlie data over tlie g'reat bulk of tlie table will be 
insig'nificant owing’ to the large numbers under observation 
and to a balance of errors, but the effect upon the experience 
at the extremes of the table, where the actual numbers under 
observation are very small, may well be appreciable. 

Eeverting to the problem of obtaining the value of c in 
Makeham's formula directly from the observations, we may 
endeavour to represent the curve of the Exposed to Risk 
by some frequency curve which can be suitably combined 
■with the formula for fjix to represent the deaths — such, for 
example, as the normal curve ^ = or the curve 

No. o, y^hx>^ey^, or by the terms of a binomial expansion 
{see Calderon, JLA.y vol. xxxv, p. 157). Unfortunately none 
of these curves give a very satisfactory representation 
of the average form of the Exposed to Risk'’^ curve. 
In the case of the binomial, in order to get a tolerable 
fit, it will be generally found that the value of oi in the 
Jcof 

expression — (representing the general term of the 


\x\ni’—x 


binomial) must be taken small ; that is to say, the data must 
be arranged in somewhat large groups of not less than about 
10 ages to a group. In either case it will be necessary, after 
obtainiug^a freqiTency curve fitting the numbers of the 
Exposed to Risk,^^ to re-compute the deaths on the basis 
of these graduated numbers. 

Thus, while it is possible to determine the values of c 
directly from the observations, the process is laborious. In 
my opinion, it is preferable to use certain trial values of c 
which we know to lie near the truth, and, by a comparison of 
the resulting graduated deaths with the original facts, to 
select a value which appears to give the best general 
agreement, which may not always be that making the third 
summation of the deviations zero.*^ 


There is a further point to be considered with respect to 
the nature of the differences between the original numbers, 
whether of deaths or of other observations, and the numbers 
obtained by a graduation following a formula such as that of 
Makeham. These divergences between the ungraduated 
and graduated numbers will in part arise from the smallness 
of the numbers under observation, and may in part arise 
from the fact that the formula does not accuiutely represent 

See Note Gr, p. 133. 
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the true curve of mortality. For the majority of mortality 
tables^ for male lives at the adult ages, Makeliam^s formula 
is so near the truth that we may in practice neglect the 
systematic errors and assume that the formula represents 
the true curve of mortality, determining our constants as 
though the whole of the deviations in the graduated and 
ungradiiated curves are accidental and due to the smallness 
of the data, but for some tables, notably those representing 
the mortality of females, this will not be the case. 

Other expressions may be given representing approximately 
the curve of as, for instance, 

fir=rjia^ + ?ib^ ( 1 ) 

whence 

logioZx=K+Mu^ + N&^ ..... (2) 

an expression which enables us to represent some mortality 
tables, such as those arising from tropical experience, that 
are not very readily represented by Makeham'^s formula. 
The values of these constants can be readily obtained either 
from 5 selected values of log or from the sums of the values- 
of selected groups of the same function. ^ 

The above formula for Zj, preserves in a modified form the- 
principle of uniform seniority. hTot, however, fn a very 
practicable shape as in order to compute values of joint-lives 
(any number) we require tables of & joint-lives of equal age 
for various values of Jc, It is of course evident from general 
considerations that the force of mortality on any number of 
joint-lives must consist of two terms, each of which is 
a member of a geometrical progression, and that if we can 
find an age w -where the relative values of these two terms is- 
the same as in the joint-life status, the actual values will 
be the same when multiplied by some suitable constant 
The required joint-life annuity will then be represented by 
the annuity on h joint-lives all of age u*. 

Take as an example an annuity on the joint-lives of {x) 
and (y). Find h and w so that 

a- + a»'=fe«]^'hence «;= Iog. («" + a»') -Iog(5-- + 6^ ) 

I log u — log 6 

and ^ = (a"' + a") -^- 0 !“ = + &*') - 4 - },'»> 

Then it is obTions that if we replace x and y by x + t and y + t, 
h will remain unaltered and w will become w + t,so that the 
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principle of uniform seniority is maintained. Thus^ an 

annuity on x and y will be equal to an annuity on Ic lives all 

aged w ; oi% since k will not generally be integral^ it will be • 

more convenient to say tliat = cb\o where a' is calculated at 

forces of mortality always k times the normal force^ age for 

age. ThuSj we shall require tables for various standard 

values of k^ and we shall usually require a double interpolation^ 

since neither w nor k will usually be integral. 

The principle of employing the sum of two (or more) 
geometrical series to represent the logarithm of a function 
such as the number living may also be used with advantage^ 
as will be seen later on^ for census tables. (See the Sixth 
Lecture.) 

As an example of this formula^ we may apply it to the 
column of log in the 0^ Table. 

Taking the values of log for ages 20^ 37^ 54^ 71 and 88^ 
we have the following data : 

log ko = 4* 98432 = K + 

log Z37=4*94279 = K + Ma"7 4.N^^37 

^ log i,4 = 4*85300 = K + + 

log Zyi = 4'5808b = Iv 4* 

log Zsa = 3*47509 = K + Ma®® + N6SS 

whence differencing_, and writing 

we have 

M' + N' = logZ20“iogZ37= '04153 = A 
M'a+]Sr'/3=logZ37~logZ54= *08979 =B 
M'a2+N'/3^=log Z54-- log Z;i= *27214=0 
M'a®+N'y3®=log Z71— log Z88= 1*10577 = D 
whence^ noting that 

BD-C^ ^ AD-BC 


AC~B2 


a + /3 -j 
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we easily obtain : 

a=5‘l082 ; a=: llOO? 

/3=l-5243 ; 1-0251 

M'= -0073886 ; M= --00026403 

N'= -0341414; N= --039657 

Tiie following comparison of tlie values of l^s and 
decrements for quinquennial ages vnll indicate the approxi- 
mation of the formula to the 0^^ Table. 

Tjuble XL 

Values of lx cind of {Ix—lx+C) according to the 0^^ Table, as 
compared toith re-graduation by formula (2) . 



ij . 


Quinquennial Decrements 

" 1 

Age 

sy , I 

Formula , 

Original 

Bv 

! 

Original j 

Errors 


Value 

Formula 

Value 1 

+ 

- 

20 

i 96,453 

96,453 

2,129 

2,066" 

63 . 


25 

94,324 

94,387 

2,467 

2,445 

22 ‘ 

... i 

30 

: 91,857 

91,942 

2,896^ 

2,947 


51 ! 

35 

88,961 ; 

88,995 

3,443 

3,528 


85 ! 

40 

85,518 

85,467 

4,158 

4,205 


47 ' 

45 

81,360 

81,262 

5,108 

5,077 

k ! 

... 1 

50 

76,252 

76,185 

6,350 

6,266 

84 ‘ 

! 

55 

69,902 

69,919 

7,927 

7,846 

81 


; 60 

61,975 

62,073 

9,775 

9,766 

9 


65 

52,200 i 

52,307 

11,606 

; 11,692 


86 1 

70 

40,594 i 

40,615 

12,753 

I 12,863 


110 ; 

75 

27,841 i 

27,752 

12,192 

j 12,222 


30 i 

SO 

15,649 i 

15,530 

9,244 

t 9,171 

73 

i 

85 

6,405 : 

6,359 

4,827 

: 4,763 

1 64 : 


90 

1,578 

1,596 

1,406 

i 1,410 


"4 ■ 

95 

172 ; 

186 

167 

; 179 

! ... ( 

12 : 

: 100 

5 i 

7 

5 

! 
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FIFTH LECTUEE. 


Although ill tlie preceding Lecture the application of 
Makeham^s formula has been considered at some lengthy itfe 
importance is such that we may now touch on some further 
points^ and particularly on the application of the formula to 
the graduation of select tables. 

The suitability of Makehani^s formula to the graduation 
of mortality tables must be judged as we should judge the 
applicability of any other frequency curve to a given series of 
observations. That is to say^ we must consider whether the 
observed differences between the graduated and ungraduated 
values (the computed and actual deaths) fall within what 
may be properly considered to be the limits of error. 
For practical purposes^ owing to the great convenience 
attaching to the use of the formula^ it is worth while to 
stretch a point in its favour. Instead, therefore, of merely 
considering the closeness of the agreement between the 
actual and computed deaths, we may consider how nearly the 
ungraduated and graduated monetary functions, such as the 
values of premiums or annuities, are in agreement. If this 
agreenient is sufficient for our purpose, we are justified 
in adopting the graduation as given by the formula^ 
notwithstanding the fact that at certain groups of ages the 
divergences between the graduated and ungraduated deaths 
may be greater than would be expected from the theory of 
probabilities. In this connection it is to be noted that our 
observations relate to past time, and that the quantities we 
are measuring are all liable to change with time. Hence in a 
graduation intended to form the basis of tables of annuities or 
premiums it is- sufficient if the general character of the 
experience is retained without insisting too strongly upon a 
strict adherence to minor features. This is illustrated by the 
following table from Principles and Methods (p. 162), in 
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whicli we may anticipate for the moment the question of the 
application of Makeham^s formula to select tables : 


Qm Whole-Life Farticipating — Males, 
3 per-cent Premiums for £100 Assured, 



Prxi 1 

G - 

-u 








Sprague’s 

-( 3 ) 

Age 





Select 


Ungraduated 

Graduated 

+ 



+ 

( 1 ) 

(2) 

( 3 ) 

G ) 

( 6 ) 

(ti) 

( 7 ) 

20 

1-379 

1-365 


•014 

1*563 

•198 

25 

1*535 

1*551 

•016 


1-703 

•152 

30 

1-779 

1-785 

-006 


1*925 

‘140 

So 

2*086 

2*081 


•6o5 

2*218 

•137 

40 

2*453 

2*457 

•004 


2*602 

•145 

; 45 

2*952 

2*940 


*012 

3*106 

•166 

■ 50 

3*571 

3*564 


•007 

3*755 

•191 

55 

4*338 

4*377 

•039 


4*635 

*258 

60 

5*413 

0*446 

*033 


5*827 

•381 

65 

6*872 

6*854 


•ois 

7*433 

*579 

1 Average 

1 

3*238 

3*222 

-004 


3-477 

*235 


Here columns (4) and (5) show how far the graduated select 
annual premium for each age at entry^ differs^ from the 
ungraduated value for the same age, while column (7) shows 
how far the annual premiums deduced by Dr. Sprague from 
the data {Journal of the histitute of ActuarieSy vol. xxii, 
p, 391) differ from the premiums deduced from the 0^^^^ 
Experience. The average difference between the graduated 
and ungraduated premiums (irrespective of sign) amounts to 
•015 per £100 assured, a quite insignificant amount; whereas 
the difference between the premiums representing the earlier 
experience and those of the 0^^^ Table, representing the 
experience of 30 years later, are all positive and average ’235 
per £100 assured. 

Only a part of the differences shown in columns (4) and (5) 
are due to any systematic difference between the mortality 
as shown in the 0^^^ data and that assumed by the formula. 
Assuming, however, that the entire differences were due to 
this cause, it will be seen that the changes introduced into 
the values of the monetary functions by using Makeham^s 
formula are a very small percentage of the actual change 
that has occurred in the value of these functions during the 
course of 30 years. 
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Altliougli^ tlieref ore, the differences between the graduated 
and ungraduated deaths do at certain points somewhat 
exceed the limits of the errors of observation^, we are justified 
in using the graduated table as a standard for the future. 

Each case must^of course^be decided upon its own merits, 
and while the Experience and the 0^^ Experience have, 
with other tables, proved to be amenable to Makeham^s 
formula, the latter cannot be treated as a law of mortality 
to which all tables may be expected to conform. As already 
stated, its suitability must be tested, as that of any other 
frequency curve, but with rather more latitude owning to its 
practical advantages. In particular the formula is not 
generally suitable for tables representing the mortality of 
Eemale Lives. 


In the last lecture we considered various methods of 
determining the constants of Makeham^s formula for best 
representing a given mortality experience, in pai'ticular 
that depending upon the agreement between the totals of the 
graduated and ungraduated deaths and of their successive 
summations. We have so far, however, considered the force 
of mortality as a function of the age only, so that our results 
are applicable only to mixed tables of mortality, not to 
select tables in which the mortality is treated as a function 
both of the age of the life and of the duration of the 
assurance. 

The formula owes its value, beyond the incidental 
advantage that it gives us a very simple and effective 
method of graduation, to the relation it establishes between 
the value of an annuity upon joint lives of any age and that 
of an annuity upon the same ^lumber of joint lives of equal 
age. From the formula for the force , of mortality according 
to Makeham^s hypothesis 

yUj: = A + Bc^ 

it follows that the force of mortality for any number of joint 
lives, aged, for example, at entry Xy y, 2 , is given by the 
formula 

^ === BA + Bc^(c^ -h + C-) 


where 
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■where t represents the period elapsed since the date of entry. 
As a valne of v: satisfying this equation can always he found, 
and is independent of t, it follows that 

Ctxyz^ ^u'lL'vs 

It is seen that the relation subsisting between the Talue 
of ic and the values of x, y, z, involves the constant c only, 
and not the constants A and B; hence, any variation 
introduced into the values of the constants A and B, having 
reference to the time elapsed since selection and depending 
only on i, will not affect the relation between the age w 
and the ages x, y, and 2 . We can, therefore, write the 
force of mortality at age 02 + t for a life select at age x as 
follows : 

+ + . . . ( 1 ) 

and still retain the relation 

-r ? + ^[ 2 /i ? d" fl[x\ + 1 = + 1 

1 ^ 

when c’'” = - (c^ + + c^) . 

o 

f # 

Equation (1) may obviously be written in the form 

Atr,^^,=A, + B,c^+^ ...... (2) 

or alternatively, if, as is often moi'e convenient, we w^ork 'with 
the values of colog in the form 

colog = + (3) 

where A^ and B^, or at and /3^, may be any functions of f, but 
are not functions of cr. We can thus represent the rate of 
mortality as a function both of the age and of the time elapsed 
since selection and so approximate fairly to the rates of 
mortality shown in an ‘'analyzed"^ or ^"select"" mortality 
experience, wliile retaining most of the advantages arising 
from the use of Makeham^s formula. The two functions of t 
have probably a tendency to become constant as t increases 
but do not necessarily become so within any special period 
from the date of entry ; they may continue to change slowly 
throughout the whole duration of the table, and in theory, no 
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doubt^ should do so^ but for pra^ical purposes it is convenient 
to make them constant after a few years (say 5, or at most 10) 
from the date of entry^ beyond which point it is assumed that 
the effect of selection has worn off. 

If we set out separately the data for each year of assurance^ 
that iS;, for each value of t so far as we intend to trace selection^ 
we shall have a series of equations (corresponding to those 
shown on p. 65 for an aggregate table) for determining the 
numerical values of the functions /(O), /(l)^ &c., ^(0); <3S>(1), 
&c.^ the value of c being necessarily that determined for 
the ultimate table. In other words^ the data for each 
year of duration are treated as representing a mortality 
table complete in itself. We obtain in this way values for 
and or for at and for each value of so far as it is 
proposed to carry the select tables. Unless^ howevei'^ the 
experience is a very large one, these values will be very 
irregular. Indeed, in the case of the 0^^ data, which repre- 
sent a large experience, we have somewhat irregular values 
for ag and even during the first ten years of assurance, 
where the facts are most numerous. The apj^roximate values 
of at and ^jSt for the 0^^ data are given on p. 157 on Principles 
and Methods.''^ If these values are plotted out, the resulting- 
curves exhibit certain obvious characteristics, as will be 
seen by the diagrams opposite where the regular lines show 
the ungraduated, and curved lines graduated values of at 
and and the horizontal lines after 10 years represent the 
values for the experience of 10 yeaiV duration and upwards, 
when they are assumed to be constant. A period of 10 years 
would appear from the data to be the shortest within which 
we can effect anything like a smooth junction between the 
select and ultimate mortality rates. 

The values of at rise very rapidly in the first few years 
of assurance, but after about 6 or 7 years they appear to 
approach nearly their final value. In the case of ^ty however, 
w^e see that if the graduated cui*ve were drawn as closely as 
is consistent with smoothness through the ungraduated values, 
it would probably not reach the level of the ultimate value 
*0000466 until after 15 years from entry, and even then it 
would be below the value of jSt for durations of 15 years and 
over. Hence it would seem that the value of /3t does not 
become constant until about 20 years have elapsed from the 
date of- entry. We may almost say that while the effect of 
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selection as reflected in tlie values of the a constant disappears 
after about 7 years, the effect upon the values of /3 probably 
continues throughout the whole of life. The explanation is, 
no doubt, that the a or A constant represents mortality from 
accidental causes and from non-constitutional diseases of short 
duration, whereas the ^ or B constant represents mortality 
due to diseases of longer duration and to constitutional 
defects. 

Having obtained numeiucal values of and for 
successive values of it remains to represent these values 
by convenient formulae. The fact that the function does 
not reach its ultimate value at the end of 10 years from 
entry, involves either some sacrifice of the agreement between 
the adjusted and unadjusted values of this function, or a 
continuation of the analyzed mortality rates beyond the period 
of 10 years, which is not very convenient. In consequence of 
this fact we cannot apply the method of moments in fitting 
a graduated curve to these values. Where ,the fitting of a 
frequency curve involves any systematic departure from the 
original facts, the method of moments often gives 
unsatisfactory results, and a curve may be produced 
departing more widely from the obseiwations than if derived 
by a tentative method. 

In selecting formulae for graduating the rough values 
of at and ^ty there are certain conditions which should be 
fulfilled : 

1. A smooth junction between the curves representing the 
select and ultimate tables. 

2. An agreement between the graduated and ungraduated 
values of at, l3t in year 0, as a special importance attaches to 
the rate of mortality in the first year of assurance. 

3. An agreement between the aggregate graduated and 
ungraduated values of these functions during the period 
between the date of entry and the ultimate table. 

To conform to these conditions as far as possible, we 
must select a curve for the values of which, whilst 
running smoothly into the constant value at the end of ten 
years, will represent fairly well the distinctly lower values of 

in the years immediately preceding. This may be done by 
representing the difference between log (the value of this 
function in the ultimate table) and logl^xj+t (the value in 
the select table) so far as this difference is due to changes 



in /3o by an expression of the form 9i( 10 — where /? is 
the ultimate value; whence we have the corresponding 
difference : 

= 2?i(10-0i8c‘'" 

so that ^({ = [1 — 2?i(10 — 35 )c“^]/3. 

The result of this, is to eliminate from the /3 constant at 
the latter durations part of the effect of selection^ and 
somewhat to exaggerate the effect in earlier years. 

We have now to decide as to the curve best representing 
the values of at* The method employed will depend very 
much on the character of the experience we are treating. In 
the 0^^^^ Experience it was again found convenient to adopt 
an expression for the difference of logioZa;+ 2 j and logiolix^+ty so 
far as this difference was due to change in a^y containing a 
term similar to that due to with the addition of a further 
term repiiesenting a geometrical series rapidly diminishing as 
t increased. The final form of the equation for the 
Experien#e was asunder — 

logloZ[x•]+^=logloZ,+^— w(10 — 

Having determined the foi*m of this equation^, the simplest 
method for determining the constants is to express in terms 
of them the difference between the computed deaths by the 
ultimate table of mortality^ and the actual deaths for each age 
or each group of ages and each year of assurance. 

We have in that way a series of equations for determining 
the values of these constants m, on', o , n, and hence of 
A-t and for each value of t, similar in principle to the 
equations used for determining the values of the original 
constants A and B. The only point that arises is as to what 
particular way we are to group the observations to determine 
those values. 

The value of m in the above formula having been 
ascertained with a view to representing as nearly as 
practicable the effect of selection upon the constant there 
remain in all^ four unknown quantities in* the formula 
to be determined^ and the actual equations used to 
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determine them were formed hj taking the first and second 
summations by ages of the whole of these expressions, 
representing the difference between the select and 
ultimate rates, first for year of assurance 0 alone, and then 
for the whole of the ten years. . 

The selection of these particular groups is, of course, not 
a question of principle, but of convenience. Each case must 
be treated with reference to the nature of the curve of 
selection, as brought out by the statistics, and such a process 
adopted as appears to be calculated to bring out the best 
results in the particular case in question. 

It may happen in certain tables that it is inconvenient to 
trace out the effect of selection for so many years, and in 
particular this is the case in a table representing the mortality 
of annuitants. In such a table the effect of selection (which 
is here the self-selection of the annuitant) persists for a very 
long time. In a table of insured lives, owing to the cessation 
of new entrants in middle life, practically at about age 55, 
the mortality at the older ages is but slightly affected by 
selection. In the case of annuitants, where there is a 
constant inflow of fresh lives up to 75 or 80 yeass of age, 
the mortality is affected by this cause throughout the whole 
extent of the table. To completely repre^sent the" effect of 
selection in such an experience will require an elaborate series 
of tables, showing for each entry age the value of annuities 
for each year elapsed since entry for many years duration. 
The tables given in Principles and Methods pp. 124, 125, 
show that as regards the and 0"-^’ Experience, and doubtless 
the same feature would be found to be general, the values of 
the expectation of life ten years after entry are appreciably 
greater than the values for the same ages derived from the 
ultimate "" rate of mortality {eix}+io > e^+io) . Consequently^ if 
the graduated rates of mortality for the first five or ten years 
from entry are employed in conjunction with rates representing 
the aggregate mortality after five or ten years, as the case 
may be, the ultimate values of the annuities, and also the 
values of the date of entry will on the whole be under- 
estimated. In any table used for the grant of annuities 
it is, however, most important that annuities at the date of 
entry shall not be undervalued, and of only less importance 
that the valubs in succeeding years shall be such as may 
be safely employed in estimating reserves.; Any method. 
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therefore^ of treating an annuity experience which tends to 
underestimate the values of annuities is clearly unsuitable. 
Full weight must accordingly be given to the effect of 
selection^ but to avoid the heavy work involved in a complete 
analysis, the expedient may be adopted of computing a 
hypothetical table of mortality which will correspond to the 
values of the annuities, let us say, five years from the date 
of entry. If this can be done successfully and the rates of 
mortality for the first five years joined on smoothly^ with the 
rates in such hypothetical table, we shall then have a correct 
measure of the value of annuities at entry and for the five 
years following, while thereafter the values will be slightly, 
but not seriously, overestimated, an error which will be on 
the right side. 

We may take as our basis either the values of the 
expectation of life^^ or of the annuities at a suitable rate 
of interest. We will assume the former to be adopted. As 
these values (e[x]+5) will depend upon separate groups of data, 
viz., the entrants at individual ages, it will not be practicable 
to construct an ungraduated table of from the formula 

= ^ ^ tbe irregularities in the individual values of 

leading tew anomalous results. A better plan will be to 
graduate the table of expectations. For this purpose, we 
may assume any frequency curve which will represent these 
expectations satisfactorily, for example, a ouxwe such as 
logio^ar = a + . W e may employ values of 

deduced from the experience of individual ages at entry, 
or we may combine the entrants in quinary groups of ages, 
taking due account of the true average age of each group 
of entrants. 

The only point of impoi'tance where difficulty arises is the 
weighting of the different equations. These are not of equal 
weight because the expectations of life, as deduced from the 
unadjusted experience, are based upon a smaller or larger 
experience, as they fall at the extremes or in the .middle of 
the table, and some method jnust be devised for giving due 
weight to this fact. This may be done by simply weighting 
the equations with the actual number of entrants at that 
particular , age, and much may be said for this method 
although it slightly underestimates the weights at the 
extreme ages. If we are dealing, for example, with the 
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values of annuities^ and approximately the same result will 
be arrived at when working with the expectations of life, the 
plan of weighting the unadjusted values in proportion to the 
number of lives entering at each age, would make the total 
cost of all the annuities by the graduated table the same as 
by the ungraduated, an agreement that would have some 
practical value. In the alternative, we may consider that 
each value of the expectation of life (or of the annuity, as 
the case may be) should be weighted in proportion to the 
reciprocal of its average error. Thus if e(j.]+5= A + 2:, where 
A is the observed value and z the average error, we shall have 

-f-1. It is difficult to determine satisfactorily the 
z z — 

average error in the value of the unadjusted expectation of 
life,* the problem being complicated by the incompleteness 
of the observations due to the existing.^^ A fairly 
satisfactory method of estimating the average error would 
be as follows. Taking the series consisting of the values 
of 60? for all values of x, each of those values depending on 
a given age at entry only, we may' assume that the observed 
second differences of these quantities + which, 

in a well graduated table, would be very small, are due to 
the errors of observation in the values and In 

any particular group of entry ages, we may say that the 
average of the central second differences (taken irrespective 
of sign) will be, on the average, proportional to the average 
error in for that particular group.f Computing the average 
values of the central second differences (without sign), for 
various sections of the table, and drawing a smooth curve 
through them, we should obtain values from which suitable 
relative weights for the individual observations could be 
deduced. 

This would be a very fair method of determining practically 
the weight to be attached to the values of in different parts 
of the table. Or we may proceed, as was actually done in 
the case of the annuity experience graduation, by assuming 
the error in the value of 6^? to be a function, first, of the total 
number of deaths in the experience representing the particular 
entry age, and secondly, of the age x. This method may 
appear somewhat arbitrary, but as only the o'elative weights 

iX 

* See, however, the Sixth Lecture, pp.- 100-104. 
i*The average value of ejc.i--2eaj-hex+i will he VH times the average error in esc. 
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are in question, it is sufficient for tlie purpose. It must be 
understood that tlie relative weights adopted do not very 
greatly affect the results. The values of Makeham^s constants 
as deduced, for example, from the values of log for ages 
25, 45, 65, 85, thus giving equal weight to the observed 
value of mortality from ages 25 to 85, would not generally 
differ materially from the values resulting from a careful 
system of weighting, although, of course, the latter are to 
be preferred. 

Assuming the exposed to risk to remain unchanged, 
the average error in the observed number of deaths is 
approximately + *8 Vnq^[l'—q) where n is the total of the 
exposed to risk and nq the total deaths. The average 
percentage error in the total deaths will, therefore, be 


proportionate to + suppose that this average 

error is distributed uniformly through all ages passed through 
by the particular group of entrants, we can then arrive at a 
rough estimate of the average error in the observed value of 
Bx, by computing the effect of -a change of, say, 1 per-cent in 
the mortality rates throughout. 

The assumptions here are not strictly accurate, as errors 
in the val^^e of arise not only from the total number 
of deaths being greater or less than the expected amount, 
but from the manner in which the excess or defect of 
mortality is distributed through the table. The neglect of 
this second source of error will not, however, seriously affect 
the relative weights arrived at, and for practical purposes the 
relative average errors in the value of will be dependent, 
first, on the average error in the total deaths observed in the 
experience from which it is deduced, and second, on the 
extent to which a given percentage error in the mortality 
distributed uniformly through the table will affect the value 
of Bx* The product of these two factors may be taken as 
representing sufficiently approximately the expected error in 
the value of Bx, remembering always that this estimated error 
is not an absolute, but a relative measure at the various ages. 
When this is done, we have, by taking the reciprocals of 
those quantities, the weights which we shall give to the 
observed values of Bx in order to determine our constants. 

It is necessary to point out that this process, while suitable 
for expectations calculated from entrants at a particular age 
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or small groups of ages^ will not apply to aggregate tables ; 
for in their case the percentage error in the total deaths 
above age oc steadily increases as x increases, so that this 
method would produce weights steadily diminishing from the 
youngest age to the oldest, which would obviously be 
incorrect. 


Notwithstanding the important effect of selection on 
mortality, it is frequently ignored, as in the and 0^^ 
Tables. It is important to consider, therefore, what is the 
net effect in a mortality table of neglecting altogether the 
factor of selection. Considerable additional labour attaches 
to the use of select tables for valuation purposes, and the 
question may be asked what kind of errors do we make if we 
neglect the fact that mortality is a function not only of the 
age, but also of the duration of assurance, and treat it simply 
as a function of the age as it is treated in the 0^^ and 
Tables. In a mortality table representing assured lives the 
effect will be seen if we compare a table like the Table 
with a table like Dr. Sprague^s Select Table, or if '^’^e compare 
a table such as the 0^ Table with a table like the 
Select Table : o ^ 


Comparison of Annual IBremiums for the Assurance of 100 
(3 per-cent interest.') 


Age 


urn 

Sprague 

OM 

0 [M] 

20 

1-427 

1-563 

1-306 

1-365 

25 

1-625 

1-703 

1-524 

1-551 

30 

1-880 

1-925 

1-790 

1-785 

35 

2-193 

2-218 

2-116 

2-081 

40 

2-589 

2-602 

2-524 

2-457 

45 

3-114 

3-106 

3-046 

2-940 

50 

3-801 

3-755 

3-730 

3-564 

55 

4-725 

4-635 

4-641 

4-377 

60 

5-987 

5-827 

5-872 

5-444 

65 

7-705 

7*433 

7-557 

6-853 


If we compare, as is most convenient, either annuity or 
premium-values, we shall find that the effect of ignoring the 
element of selection and treating the mortality rates as 
a function of the age alone is that, at the younger entry 
ages, premiums are underestimated and annuity-values are 
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overestimated * The 0^^^^ premiums should^ properly speakings 
be compared with those derived from a table representing the 
true aggregate of the select tables^ but no such table is avail- 
able. There is a point, which is in general somewhat greater 
than the average age at entry, at which the two curves 
representing the premium values for the mixed and select 
data cross each other, and for the older ages the premiums by 
mixed tables are greater than those by the select table. The 
extent of the differences in the premiums is sufficient to 
render it necessary, in adopting a basis for assurance 
premiums, to take into account the question of selection. The 
only plan by which the use of select tables can safely be 
avoided, is either by adopting a special form of loading 
or by throwing out altogether from the data upon which the 
premiums are based those years of assurance which are 
seriously affected by selection, that is to say by employing 
a table of the or type. We then obtain a table 

which at all ages overestimates the values of the premiums 
and underestimates the values of annuities. 

A table representing ultimate rates of mortality, that 
is, of the QM(5) type, is therefore a safe one to employ 

for the grant of assurances, although not for the grant of 
annuities.^ There i?, indeed, very much to be said for the use 
of a table of that kind for assurance purposes, but, to 
discuss that question, we should have to go into the finance of 
life assurance valuations, which hardly comes within the 
scope of our subject. 

With a view of avoiding the necessity for select tables, a 
device was adopted by the American offices in their first 
experience denominated the final series method. Tlie 
object was to produce a table not entirely unaffected by 
selection, but in which its influence would be reduced to a 
minimum ; a table of mortality similar to that which might be 
supposed to prevail in an office of great age doing a uniform 
and steady new business. To produce that result the lives 

^ This is shown, in the table above, to be the case both with the and 0"^^ 
Tables. Unfortunately, however, in neither case is the comparison very 
satisfactory. Dr. Sprague’s premiums from the method of their calculation 
are probably somewhat higher than the true values, and in the case of the 
0^^ Table we are comparing select jjremiuins based in part upon the aggregate of 
the select tables, excluding first ten years from entry, with O^^^premiums based 
upon an aggregate table from which there had been a further elimination 
of duplicate assui-ances. 

G 2 


84 


existing at the close of the observations were traced out 
through a hypothetical future in which they w^ere assumed to 
he subject to rates of mortality and lapse identical with 
the rates actually observed in the past among lives of similar 
age and duration. The minor details of the process we may 
pass over. The result from a financial point of view is that 
the premiums are still underestimated for the younger 
insuring ages^ although not to the same extent as in a table of 
the type^ and are overestimated at the older ages^ the 
point at winch the values cross the true curve being earlier 
than would have been the case had the final series 
adjustment not been used. There are some practical 
difficulties in adopting a method of this kind. One of these 
is that after some 15 or 20 years^ duration the observed rates 
of mortality for individual ages and years of assurance 
depend on a very few facts. We then have to apply 
the very irregular rates resulting from those few facts 
to much larger numbers^ including the existing lives that 
have been brought back hypothetically under observation; 
so that where these irregularities become inconveniently 
large^ the application of the method must ceasn; or else 
these irregular rates must be subjected to some process of 
graduation before being used in the calculations. 

This difficulty could be met by using a species of 
QM(i5) Qj. QM( 20 ) fQP risks of 15 or 20 years^ duration and 

upwards^ instead of the rates of moi'tality deduced from 
individual years of assurance. There are^ however^ other 
objections to this method as an expedient for counteracting 
the effect of a too short average duration of assurance. 

As the rate of mortality amongst assured lives cannot 
strictly be treated as a function of age alone^ but is also 
dependent upon the duration of assurance^ so the rates of 
sickness in a Friendly Society^ or of re-marriage in a 
Widow^s Fund^ are affected^ respectively, by the duration 
of membership, or of wddowhood. Sufficiently approximate 
results may, however, be generally arrived at in these cases 
by treating the rate of sickness, or of re-marriage, as a 
function of the age alone: in the former case because 
the effect of selection is not very great and is soon exhausted, 
in the latter case because the average constitution, as regards 
the duration of widowhood, of a group of lives passing under 
observation at a given age will be found to remain fairly 



constant (unless tlie Pension Fund is of recent establisliinent) 
and the financial effect of a marriage when it occurs is a 
function of the age only. 

Where, however, we are dealing with rates of dis- 
continuance or lapse, it is important that these should be 
analyzed both as respects age and duration. Owing to the 
fact that the financial effect of a discontinuance is mainly 
dependent upon the duration of assurance, very erroneous 
conclusions may be deduced by treating the rates as functions 
of the age alone as has sometimes been done. If this course 
is adopted special precautions must be taken, such, for example, 
as deducing the rates from a body of lives representing the 
existing some 10 or 20 years back, and excluding from the 
exposed to risk all more recent entrants, as proposed by 
Mr. A. W. Watson {J.I.A., xxxv, 313-4). 


SIXTH LECTURE. 


IjST the concluding Lecture we shall deal with some 
miscellaneous points of general interest or arising out of the 
previous Lectures. We have already dealt with the nature 
of the modifications of Makeham^s formula for the force of 
mortality, necessary to enable us to represent satisfactorily 
the mortality shown by select tables such as the 
These modifications consisted in treating the quantities A 
and B or a and /3 in the formulas 

= xi + B . ; colog = a 4- /S . 

which are constants as regards the variabl^e x, as f;|^nctions of 
t the time elapsed since the date of selection. 

It is clear that a similar course may be pursued if any 
other formula than Makeham^s is employed in the graduation 
of the ultimate table. Thus we may write 

where At and will in general be such functions that, as t 
reaches a certain value, at which the select and ultimate 
mortality rates merge, At becomes zero and Bt nnity. The 
form of these expressions employed for representing the 
effect of selection suggests that a similar form may be 
employed for representing rate of discontinuance, which in 
general may be taken to be a function of the duration of 
assurance and of the age at entry. The same remark applies 
to such a function as the rate of remarriage amongst widows, 
which is, similarly, a function of the duration of widowhood 
and of the age. 
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Altliougli we have dealt at considerable length with the 
nse of Makeham^s formula in connection with mortality 
tables^ there are some further remarks to be made as to its 
employment in certain special cases, more particularly in 
connection with the age statistics at a Census. 

If we suppose a population which is (1) subject to uniform 
rates of mortality, corresponding at the adult ages to 
llakeham^s formula, (2) such that the numbers living 
represent the survivors from a number of births increasing 
annually in a geometrical progression, and (3) is subject 
to a rate of emigration or immigration uniform at all 
ages, then if represent the numbers in the population, 
at a given moment of time, passing through the exact age 
obviously the curve of will follow Makeham^s formula, 
and if we write 




A. 

dcV 




i'. 


:(A + r)+B. 


we shall have a formula similar to the usual formula for the 
force of :qiortality^ but -with the constant A increased by r, 
the rate per annum at which the population is increasing; 
that is to say, the natural rate of increase less the rate 
of emigration. It is true that hardly any population 
will be found to conform very closely to the above 
assumptions, but nevertheless it will be frequently found 
that the population curve for the adult ages does conform 
to Makeham^s formula for Z^., although in most cases it 
will be necessary to adopt Makeham^s second development 
of Gompertz, with the additional constant in the expression 
for jjLx^ 

If the population is given, as is usual, for decennial age 
groups {e.g.y 15-25, 25-35, 35-45, &c.), the values of the 
ordinate for the middle age of each group may be 
obtained with sufficient approximation by deducting from 
each term Ux of the series representing the numbers in 
successive age groups one twenty-fourth of the central 
second difference 


V 24 ; 
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From the values o£ thus obtained, by writing 

log Vcs='K + s.x+g.c^, 

or, logV^='K + s.x + h.x^+g 

as the case may be, the constants may be determined as for a 
mortality table. 

Take, for example, the male population of England and 
Wales, enumerated at the Census of 1901, as under : — 


Table XII. 


Male Population in Age-groups : England and Wales, 1901. 


Age 

Group 

Xumbers^ 

Central 

Ordinate 

A -a;-! 

log (3) 

Alog(3) 

Anog(3) 

a 3 log (3) 

Col. (4) 
Adjusted 







(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

(8) 

15-25 
25-35 
3o— 4o 
45-55 
55-65 
65-75 
75-85 
85-and 
over 

94,693 

76,425 

59,394 

42,924 

27,913 

14,691 

5,080 

552 

76,373 

59,371 

42,863 

27,838 

14,541 

4,868 

4-8829 

4-7736 

4-6321 

4-4446 

4-1626 

3-6874 

-‘1093 

--1415 

-•1875 

-•2820 

-•4752 

-•0322 

-•0460 

-•0945 

-•1932 

ft 

-•0138 

-•04l?i5 

-•0987 

c 

4-88349 

4-77301 

4-63269 

4-44401 

4-16319 

3*68681 


*To reduce the magnitude of these numbers, the figures used are those 
corresponding to a total population (M & F) of 1,000,000 as given in the Census 
Report. This, of course, does not affect their relative value nor the form of the 
curve. 


Here^ eyidently^ Col. (6) cannot be -vvell represented by a 
Geometrical Pi’ogression^ but with Col. (7) this is possible 
without very serious changes in the values. This would give 
a formula corresponding to Makeham^s second modification of 
GompertZ; viz., 

log I X— P- "b -Aej/ -f* A. . “I” !B . 

for the values of the logs of the numbers living at age x, 
given in Col. (4). As these numbers are only approximate, 
and our object is merely to show the applicability of the 
formula as a base line, we may adopt a very simple method 
of determining the constants, similar to that used by 
Mr. Makeham in his paper on the Law of Mortality {J.I.A., 
xiii, p. 338 et seq.). If the terms in Coh (4) are alternately 
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diminislied and increased by a quantity the quantities in 
Col. 7 will become 

— •0138 + 82; 

— *0485 — 82 ; 


-•09 87 + 82 ; 


These terms can obviously be made to form a geometrical 
progression by suitably determining 2 ;, and their common 
ratio^ found by dividing the sum of the second and third 
terms by the sum of the first and second^ will be equal to 


1472 

t)23 


2-363. 


Dividing the sum of the first two terms by 3*363 we get 
0623 

— *01853 as the adjusted first term^ giving 


3*363 


82 ;= —47*3 and 2 ;=— 5*9. Hence the transformed series for 
Col. (4) is as shown in Col. ( 8 )^ where the progression 
accurately follows Makeham^s second development. 


It is on the whole more convenient to deal with the 
numbers ^living above age x rather than the numbers for 
the decennial age groups. 

•If we treat t*lie numbers in Table XII in this manner, 
representing the numbers living above age x by the 
expression 

log Q^, = K + 

we shall have the results set out in the following table, where 
the values of the constants have been determined by ignoring 
the extreme values of log at ages 15 and 85, and equating 
the sums of the values of the above expression to the values 
of (logQaa + logQas); (log Qss + log Q 45 ), &c., by which means 
we obtain for the values of the constants 

log a= *006420 {mor^) = — 1*0582 
log & =*035184 -007933 

K= 6*4222 

The five figure logarithms of were employed in the 
calculation, but, owing to the nature of the process, the fifth 
figure in the graduated column cannot then be relied upon ; 
the logs have therefore been throughout cut^ down to four 
figures in the table, which is quite sufficient for the purpose 
of illustration. 
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Table XIII. 


MaJe Population living alove the undermentioned ages. — JEngland 
and Wales, 1901. 

(Based upon figures in preceding Table i) 


Age 

i Proportional | 

; Numbers | 

log Qa: 

Alog Qx 

log Q!x 

j 

Alog Q'x 

log Q'x- 

-logQi 

— 



X 

! Qx 

' 1 


K + + nh^ 


+ 

- 

. 15 

321,672 

5-5074 

i - *1514 

5*5059 

1 

-•1498 


•0015 

i 25 

226,979 

5-3560 

i 

: - -1783 1 

5-3561 

--1785 

•0001 


i 35 

' 150,554 : 

5*1777 

! 1 

j - *2179 

5*1776 

-•2177 


•0001 

! 45 

i 91,160 ' 

4-9598 

! - -2764 

4*9599 

-•2766 

•0001 


1 55 

1 

i 43,236 

4*6834 

! 

i - -3754 

4-6833 

-•3753 


•0001 

! 65 

i 

20,323 : 

i ' 

4*3080 

- *5573 

4-3080 

-•5574 


i 

75 

; 5,632 ■ 

3-7507 

i 

! -1-0088 

3-7506 

-•9218 


•0001 

85 

1 

: 552 1 

2*7419 

! 

j 2-8288 


•0869 



The practical identity of the carves at all ages^'except 15 
and 85^ which values were not used in determining the 
constants^ suggests that very accurate results might be 
obtained by making use of a curve of the above form for 
interpolation of intermediate values of 

It has been proposed to employ Makeham^s formula to 
represent the curve of sickness rates at successive ages^ and 
this has been done with a certain degree of success^ but the 
practical advantages of the formula as applied to sickness 
rates are not very apparent, as it is usually necessary to 
know not merely the total sickness rate at each age but its 
division into sickness of various durations, as the number of 
weeks per annum during the first six months of illness, from 
the sixth to the twelfth month, after the twelfth month, 
&c. As Makeham shows {J.LA. xvi, 414), the ratio 
Weeks sickness experienced in the year of age 
Exposed to risk in middle of year of age 
is not a function similar to px but to since it has a definite 
limit, namely, 52, or 1 if the sickness is expressed in years in 
lieu of weeks.'’ Hence if we represent the above ratio by the 
symbol we should write 

log (52 — =A + B.c^. 
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Where^ by the constitution of a society there is no formal 
superannuation^ the sickness benefit continuing throughout 
life; it is almost invariably the practice of actuaries in using 
Sickness Tables for the purpose of computing contributions 
or valuing benefits to assume that the so-called sickness 
will become chronic after a certain age^ 70; 75; or 80. In 
such caseS; as the rates of sickness actually employed will 
generally be much below the maximum of 52 weekS; we may 
use log (N—Sa:)=A.-\- Bc^’ . The value of N must be determined 
by trial. 

Mr. King has given an example in the graduation of the 
values in the Text-book mortality table at the youngest ages 
of a fux’ther application of Makeham\s formula; the term 
in the expression for the force of Mortality representing; of 
course; equally well an increasing rate of mortality as in adult 
life or a diminishing rate as in infancy and childhood. 

In the common case of an asymmetrical series the terms 
of which become zerO; or very nearly so, at each end; the 
following method of employing the ^‘norrnaK^ frequency 
curve to ijjepresent the series will often be found convenient 
and effective; particularly if the data are presented in the 
form of % few groups. Let the successive ordinates of the 
curve be represented by the equation yz=:f(iv ) ; we shall 
assume the total area of the curve to be unity and the area of 
curve between the limits A' =00 and x=zt will be Let 

us write 

J£ V TtJ -00 

so that Yo = l=-i=-[ 

where ^ is a function of t, the form of which is to be 
determined by the data. For most purposes it will be 
sufficient to treat 2 as a parabolic function of but it will bo 
seen later that there are certain cases in which a different 
hypothesis as to the form of the function ^ is to be preferred. 

An example will make plain the method of proceeding. 
Take the 0 ^^ data as summarized on p. viii of the volume 
of Unadjusted Data (Whole-life; Males). In the last two 
columns of the table there is given the ‘^proportionate 
distribution per-cent '' of the exposed to risk and died. 
Taking the figures there given we obtain the following tables. 
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The values of z are found by entering a table of -^J 

for + and — arguments with the values in the second 
columns of Tables (XIV) and (XV). We may employ a table 
such as that given by Woolhouse {J.I.A., vol. xvii, p. 50) or 
that given on pages 138, 139, at the end of these lectures. Note, 
however, that in each of these tables the function tabulated 

O 

js say I,,, for + arguments only, so that the total 

0 

area of the curve from — oo to +oo is 2 instead of 1. Hence, 
if Yi is > i we must put 


Y,= ^-+K=-^-- 




y7rj . 


+ 


1 2 


2 V it] i 




dt 




so that 2 takes the value corresponding to the tabular value 
I^=2 Yj 5— 1. Similarly, if Y< is we put negative and 
numerically equal to the argument, giving 1^=1 — ^Y^!. 


Table XIV. 


0^^ Dai a. Exposed to DisJc, 


Age 

t 

Proportion 
Exposed, to Risk 
above age t 

1 

= — •/ e-t-dt 

'V'JT ^ -X 

Values 

of 

2 

Az 

A"z 

A^;r 

A-b 

A*S 

0 

10 

20 

30 

40 

50 

60 

70 

80 

90 

I’OOOOO 

•99991 

•99584 

•90060 

•65989 

•39810 

*18795 

*05951 

•00927 

*00039 

00 * 
2*6500 
1-8660 
*9086 
*2915 
- *1826 
- -6261 
-M023 
-1*6650 
-2*3750 

-•7840 

i-*9574 

-•6171 

l-*4741 

-*4435 

-•4762 

-•5627 

-*7100 

*3403 
•1430 
' *0306 
-•0327 
-•0865 
-*1473 

-*1973 

-•1124 

-•0633 

-•0538 

-•0608 

*0849 

•0191 

•0095 

-•0070 

-•0358 

-■0396 

-•0165 


* Theoretically the values of z corresponding* to a total frequency of 1 and 0 
are respectively d=co. As however 2 = ±3 corresponds to y = ’999989 or 
•000011, ±3*5 to Y- *99999963 or *00000037, and 2 = ±4 to Y- *999999992 

or *000000008, it will be seen that any value of z over 3 will sufficiently represent 
' distribution or the zero value, and in practice it would be quite 

sufficient to insert at the ends of the table any convenient value of z over 3, 
- and consistent with the general run of the intervening terms. 


Table XV. 


0^^ Data. Deaths. 


Age 

t 

Proportion of 
.Deaths 
above age t 

Vfl- «/ - 00 

Values 

of 

Mean 
Error of 

in last 
place 
of 

decimals 

Az 

A-z 


f> 

> 

it' 



0 

10 

20 

30 

40 

50 

60 

70 

80 

90 

1-00000 

1-00000 

•99925 

•97565 

•88854 

•74174 

•53731 

•28908 

•03169 

•00590 

oo * 

00 * 
2-2450 
1-3939 
•8618 
•4587 
•0663 

- -3932 

- -9856 
-1*7806 

il92 
i 48 
i 28 
d= 24 
i 23 
i 24 
i 32 
i 82 

-•8511 
-•5321 
-•4031 
-•3924 
j — -4595 
1--5924 
|-•7950 

•3190 

•1290 

•0107 

-•0671 

-1329 

-•2026 

-•1900 

-•1183 

-•0778 

-•0658 

-•0697 

•oln' -'0312 
-•0285 : 
-COM -0159 i 

i 

i 

j 1 


* See note at foot of Table XIV on preceding: page. It is to be noted, that in 
lien of the integral of the normal frequency function, the function e«/(l + e®) 
may be used, leading to a metliod of procedure similar to that referred to 
on p. 51. 


The colanm containing the mean error or standard 
deviation^of s in the table of deaths is computed as follows. 
If the total of the series (in this case the total deaths) is n, 
and the total above a given point (in this case the number of 
deaths above age t) is w, then the mean error in m is equal 


to 


. From this can be calculated the mean 


of 


errors of the values in column (2). The change in the value 
z corresponding to a given change in the values of 

-j' e-^-dt in column (2) being known from the table of this 

function we obtain the values in column (4) . These standard 
deviations are not inserted in the table of Exposed to Ris ^as 
the principle upon which the mean errors in the proportionate 
distribution of the deaths are computed is not strictly 
applicable to the table of Exposed to Eisk^ when the a ei 
represent observations spread over a long and continuous 
period, although it would be applicable if the numbers dealt 
with represented the exposures in a single calendar ^ar. 

If we examine the columns of the succesave erences 
of a in the two tables, ignoring the infinite values ot z 
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corresponding to a total distribution of unity we shall see 
that they exhibit a remarkable similarity in the nature of 
their progression, especially from the colnmns onwards. 
It will also be apparent that a very small alteration of the 
original values of % in either table would be needed to make 
the fifth differences constant ; that is, -we may assume without 
serious error that 

z=.a-\-ot-\-c, — - +&C. 

In order to obtain the closest agreement with the 
original facts due regard would have to be taken of the 
weights corresponding to the mean errors in the value of 
as given in the table. But we shall obtain results quite 
good enough for all purposes by the following simple 
procedure. It will be observed from the values of the 
mean errors that the values of 2: for ages 40 to 70 have 
approximately the same weight, those for ages 30 and 80 
have somewhat less weight and finally those for ages 20 
and 90 much less. 

If we combine the values of 2 in sets, thus, 

r 

% + 3^30 -i- Z4Q ; ZsQ + 3^40 + 2^30 ; &c., 

with their corresponding numerical values we shall obtain 
six equations to determine the six coefficients, a, 5, c, . . ./. 
Into these equations the values Zoo and 2:90 will enter once, 
the values 2:39 and Zso four times, and the remaining values 
five times. We need not compute the numerical values 
for all these equations as it will be evident that if we 
write them down and difference them we shall arrive at 
the following: 

5a + 5b + G= ^20 d" 3^30 + ^40 = 7’2885 
5h + fic-j- cZ= A (2^20 d" 3^30 + 2/4 o) = — 2*8505 
5cd"5dd"e== A^(22od’32J3()d“2;4()) == *7167 

5d + 5e d-/ = A^( z 2 o d- 3230 d- 2;4o) = - * 6227 
5e-h5J^ =A'^(22od“ 3^30 d" 2:40) = *2052 
5/' = A^ (220 d“ 8230 d” 2:40) = — ‘1 326 

From these equations the values of /, e, cZ, &c., can be 
obtained 'with great facility. 
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Having obtained a formula for 2 in terms of t, we can now 
obtain any term in tlie series and can also obtain the 
value of y, the ordinate representing the number of deaths at 
age X (i.e.; approximately between ages x—h and since 

1 dz 

'ill 

and % I 4 A-.^,+ | A^r,. 

It will generally be sufHcient to compute the values of y 
for decennial or at most quinquennial intervals and to 
interpolate the resulting values of yx or for the inter- 
mediate ages. 

The values of the quantities cq &c., satisfying the 
above equations, are 


a = 

2-24374 


•186796 


- -849365 

<3 = 

•067560 

c = 

•316624 

/= ~ 

•026520 


It may be of interest to give the adjusted values of and 
the distribution of deaths corresponding to these which are as 
under : 

Table XVI. 

0^^ Data. Deaths. 


Adjusted values of z and adjusted distrihuiion of Deaths. 


Age 

- 

- \ f (t • 

Last column mor«.‘. ( + ) or 
less ( — ) tliau corresponding 
column ill Table (X.V). 

0 

6*136-15 

1-00000 


- 

10 

3-69061 

I'OOOOO 



20 

2*243 / 4 

*99925 



30 1 

1-39438 

■97509 

*00004 


40 

•861()() 

•888-48 


■C166OG 

50 

‘45872 

'74174 

*00600 


60 

*06640 

*53740 

-0000!) 


70 

- -39352 

*28893 


•06015 

80 

~ -98473 

*08188 

•06619 


90 

-1*78289 

'00585 


•00005 

100 

-2*90220 

'00002 

... • 
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The principal objection to this adjustment^ paradoxical as it 
may sound, is that it too closely follows the original facts, the 
deviations being very much smaller than the probable errors 
of the observations. This is, of course, due to the fact that 
we have included too many constants in our formula. A 
constant fourth difference in the values of however, may 
lead to anomalous results, and a constant third difference 
makes the errors of adjustment too great. The best plan in 
such a case would be to adjust the exposures by- using a 
constant third difference, to recompute the deaths to 
correspond to the adjusted exposures in the 10 year groups 
and then employ a constant third difference for the graduation 
of the death curve. Or, as an alternative, an expression for 
z may be assumed of the form 

7 , 

= — 

a-\-x o + x 

and the values of h, m, n, a, &, determined by weighting the 
equations in a manner similar to that shown above for the 
fifth difference curve. 

We have used the 0^^ data to illustrate the above process, 
but generally speaking the latter will be found more useful 
where the data are only available in large groups, and, in 
particular, where the limits of the series are not well defined. 

In the following table we have a statement taken from 
Supplement to the Kegistrar-GeneraTs 45th Annual Keport, 
p. cxviii, showing the number of Innkeepers, &c., living at 
or over certain given ages. 

Table XVII. 


InnJoeepers, Fublicans^ ^c. (1881). 


Ages 

t 

Living 
above age 
t 

Proportional 

numbers 

= 4 - 

Values of 

15 

232,890 

1-0000 

00 

20 

230,280 

•9888 

1*6147 

25 

222,213 

•9542 

1*1929 

45 

105,153 

•4515 

- *0862 

65 

14,451 

*0620 

-1*0877 


It will be seen that more than 50 per-cent of the numbers 
living are in the age-group 25-45, and nearly 40 per-cent in 


I 


97 


the gTOup 45-65. In such a series the usual methods of 
interpolation would probably give unsatisfactory results. 

If we treat the values oE 2 as having constant third 
differences^ we obtain the following equations, taking five 
years of age as the unit — 

a =1*6147 

a + h .. =1-1929 


a 5J^ + 10c + 10(£= - -0862 

a-f-9& -{-86cH-84ti= — 1*0877 

hy Cy and d are the values, reckoning from age 20, of the 
■differences of 2 . The values of a and h are given immediately 
.and solving the remaining equations for c and d we obtain — 

u= 1-6147 c=-04863 

4218 d= -*00782 

which enable us to form at once the following series of 
■quinqueniAal age groups. 


^ ^ Table XVIII. 


Junlceepers^ JPublicaiis^.^c, (1881 Census), 


Age 

t 

Interpolated 
Values of 

Corresponding 
Values of 

- 

00 

* 

Proportional 
Population between 
Ago 

t and (?^ + 5) 

15 

2*0930 

•9985 

97 

20 

1-6147 

•9888 

346 

25 

1*1929 

•9542 

774 

30 

•8197 

•8768 

1221 

35 

■4874. 

• 754.7 

1499 

40 

•1880 

•6048 

1533 

45 

- *0862 

•4515 

1376 

50 

- *3429 

•3139 

1119 

65 . 

- *5904 

*2020 

834 

60 

- *8360 

•1186 

566 ^ 

65 

-1*0876 

*0620 

342 ^ 

70 

-1*3534 

•0278 

176*7 i 

75 

-1*6407 

“ •01017 

73*5 ■ ■; 

80 

1 -1*9577 

•00281 

22*5 ■ 1 

85 ' 

- 2*3121 

•00053 

4*7 j 

90 

1 -2*7117 

*00006 

■' -6 1 

.... 1 


# ^Representing tlie popnlatiou living above age -a; oiit of, a total population af 1. 


H 
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It will be seen that this distribution shows a small number 
of cases below age 15. This may be avoided if it is desired 
to commence the curve at and not before that age^ by 
writing 

= 7 — 4* <36 +• + ci“ 


giving 


^=b + 2ct 

at 


m 


the term Toeing introduced in order to give the high 

t — ‘ Xo 

values of 2 : required near the origin, or, we may write as 
suggested above, in connection with the 0^*^ data, 

, m , n 

h T— 

a -{-X b-\-x 


the value of a being taken in this case as equal to —15. 

This form for the value of ^ will be found very convenient 
where the series is known to be limited in either direction and 
the number of groups is small. In certain cg^ses either 
a or 6 may be known, and we have, then, only four constants, 
m, n, h, and h or a, to determine, for which four groups will 
suffice. Or it may be convenient to assume values for both 
a and 5, in which case with four groups we may write 


_ m n 
t + a~^ t-^ b 


-j-Jc-i-cty 


determining m, 71 , Jc and c from the data. 


In the case of any statistics intended to he used by the 
actuary, it is important to consider not only how far they are 
suitable for the purpose for which they are to be employed, 
but also whether the data are sufficient to render the 
conclusions drawn from them safe. We have already referred 
to this question in general terms, but it is necessary tO' 
consider it rather more closely. 

In practice the actuary has to deal either, (1), with tables, 
based upon a large number of observations; for example,, 
tables such -as the 0^, the Grovernment Annuitants, the 
Manchester Unity Tables of Sickness, &c., where the 
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accidental errors due to the limited numbers are practically 
insignificant, but irYhere, on the other hand, there may be 
uncertainty as to the suitability of the experience for the 
case in hand; or, (2), with data of more limited extent but 
known to be applicable, as in the valuation of a pension 
fund of a Friendly Society by tables based upon its own 
experience. 

In the latter case it is important to be able to form some 
judgment as to the extent of the probable errors involved in 
the use of the data and their effect upon the financial values 
deduced therefrom. This is a problem not susceptible of an 
exact solution. It is true that if the series of numbers 
representing the deaths, marriages, or retirements, as the 
case may be, can be represented by a frequency curve, the 
probable error of the constants may be obtained in the manner 
shown by Professor Karl Pearson in his paper on this subject. 
But these results will be little practical use to us, as- 
the manner in which these probable errors, which are not 
independent, will affect the monetary values deduced from 
the graduated rates is too complicated. We can only deal 
with the problem in a very general manner. We are not 
even sure that the ordinary theory of errors is applicable to- 
such functions as fates of mortality, sickness, or superan- 
nuation ; indeed, we may well suspect that it is not strictly 
applicable. 

If the probability of throwing head at a single toss of a coin 
is one-half, and if in 100 throws 54 heads appear to 46 tails, we 
do not suppose that the probability of the average number of 
50 heads appearing in the next 100 throws is affected. But 
in the case of the probabilities of death it may well be that 
an abnormally high or low rate of mortality in a given year 
may affect the probable rate in succeeding years, and that 
there may be a tendency for the deviations from the average 
result to correct themselves, a low rate in a given year 
leaving a larger number, and a high rate a smaller number, 
of impaired lives surviving, and thus changing for the time 
being the constitution of the group under observation. 

The standard deviation in the value of as deduced 
from a given experience has not, that I am aware of, been 
estimated. It will be instructive to attempt this, as an 
example, for the table. It will be sufficient to use 

approximate methods, as the results will be quite accurate 

H 2 
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■eiiougli for our purpose. We shall assume that we may take 


cologeP=W'== — ~— 

and that if the standard deviation in log y=icrj then the 
standard deviation in ^ = cr?/.'^ 

Taking the observations at a given age let us put 

•exposed to risk =n 

graduated or true rate of mortality = g'. 
graduated deaths —nq =0 

actual deaths =nq + 2 = 0' 


■observed value of q 


= 5 


— } 

n 


wherOj as we have seen, the average value of z is zero, the 
average value of z^=^nq[l — q)y &c. {see p. 110). 

Then the observed value of m=m' where 


m 


__ iiq'\-z 


'71 — 


2 


+ (terms in powers of z) 


=m+/(z), say 
= colog«j)+/(z) 

It will be found that the average value oif{z) is not quite 


(j2n 47i‘^) 


zero though very nearly so, being equal to m 
nearly, a quantity that may be neglected; and that the' 

. 7)1^ 

average value of [f{z)y is — very nearly, and 

71/q 

v'^BjYevQ^ge Yolue oi [f{z)y=: 

Hence, the standard deviation in the central death (or 
marriage or secession or. any similar) rate is very nearly equal 
to the rate divided by the square root of the number of deaths 
(marriages or secessions, &c.). The errors in log^p are of 

* If logey liave tlie small error cr, y ’will be changed to e^oge2/+cr_,y go- 
*=y(l + cr+ . . .), i.e., the corresponding error in y will he ay nearly. 
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course the same, but of opposite sign to those in colog 
Let the observed value of logePx be loge^J^'a-. We will write 

'^Ogep'o^ = logeP^c + '^('0^ 

where is the error of observation whose value in a particular 
case is fixed but unknown, the average value over a long 
series of similar observations being zero, and the average 

value of being' — or • where nq is the 

^ nqa: nq^ ^ 

graduated number of deaths at age x. 

Taking an arbitrary radix for our mortality table, say l^y 

the values of log Ix^t foi' ages above x will be 

loge^ d?“lo^e^j7 

log^Z'or+l =logeZjtr + l + 


log'e^ .1-+^ — ^Ogelcc+t-h {Ux + Ux^i + . . + 

similarly, ^ve shall have 

^ logD'a—logD^ 

and for higher ages 

loge D ^^^ = log‘c + • • . 


whence, on the principle of approximation laid down above,. 






D..4 

■ D, 


Summing this for all values of t from 1 to infinity, we shall 
have 

N' 

Ct X jQ?" = [Na,+ + + -f-Dj;.. 

Here the quantity in the bracket in the numerator is the 
error in the value of as deduced from the observations in 
relation to the value of D';^ corresponding to the arbitraiy 
radix assumed at that age. The average value of each term 
in the bracket is zero, and the square root of the sum of the 
average values of the squares of these terms divided by 
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will give the standard deviation in the value 6f a' ^ as deduced 
from the data, which, omitting the suffix becomes 

]g^\/ &c. 

If the mortality table be graduated the standard deviation of 
the graduated values of a ^ will be somewhat less than that 
•of the ungraduated values, but not materially less, except at 
the ends of the table, the principal effect of the graduation 
being merely to produce a smooth progression in values. 

We might assume, for example, that the effect of graduation 
was about equivalent to substituting the average error of five 
successive values of for the error of the middle value. 
This would give (omitting a quite insignificant term) the 
expression 

5I)~ + W-ar+i (4Na;+ 1 + Na7+2) 

+ + &c.] 

for the error in the graduated value of a'a? in lieu of the 
expression given above. 

If we shorten the expression for the standard deviation 
of a' a: from 

to its approximate equivalent 

1 / 

^ \/5'Z^2^.N2^-f5W7^.N7^ + 5Wi2‘^.Ni2^ + , &c., 

.and, further, take 

5^2^ 25[cologe(fe)]" 

Observed deaths between x and (05 + 5) 
we shall considerably shorten the labour of calculation, and 
at the same time, by slightly underestimating the required 
value, make a rough allowance for the effect of graduation. 

We are now in a position to compute a table of standard 
deviations for for quinquennial intervals of age, the 
principal steps of the working being set out in the table 
following. The final columns showing the mean errors or 
standard deviations in the value of and the corresponding 
mean errors in found by dividing the former results by the 
•quantity + 

* If ax have slx^ error cr^r, then Par will have the error 

-_JL_ ^ ^ ^ ^ _ (fx 

+ / \1 + OTa; / l + l + aa + flTa; (1 + (1 + 

= (Tx-t- (1 + flx)' nearly. 
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Table XIX. 


Computation of the standard deviations deduced values 

of ax o.ud cTx deduced values ofVx- 


! 

25[COlOgcpj:+.2]“ 

xlOO 

Deatlis 

between 

SH-jr+oXlO-i 

= 102(2)-(3) 

xlO-^ 

Sum of 

10- Vcol. (t>) 

crx 

I ^se I 

Ages 

X and 

• last column 

— CTx 

(i -i- 0.x)“ 

i : 

! i 


x+5 






Cl) 

(2) 

(3) 

(4) 

(5) 

<G) 

(J) 

(8) 


•1020 

10 

1*020 

20244* 

21570- 

*2200 

•00038 

i SO 

*1113 


•0912 

1163* 

1326- 

•0653 

•00012 

' So 

•1266 

924 

•01370 

110-0 

162-5 1 

•0274 

•00006 

30 ' 

•1525 

3,072 

•004966 

24*48 

52*52 1 

-0187 

*00004 

35 

i *1981 

5,689 

•003482 

10-20 

28*04 1 

•0165 

•00004 

40 

' *2813 

8,152 

*003451 

5*758 

17*84 

*0159 

•00005 

45 

i *4410 

10,257 

•004295 

3*864 

12*08 

•0160 

•00006 

50 

; *7632 

12,620 

•006048 

2-726 

8*215 

0164 

•00007 

55 

1*444 

14,903 

•009694 

1*986 

5*489 

•0169 

•00010 

' 60 

2*945 

16,618 

•0L772 

1*445 

3*503 

•0177 

•00014 

65 

6*359 

17,455 

*03644 

*9770 

2*059 

•0187 

•00021 

70 

14*32 

16,042 

•08929 

*6052 

1*082 

*0203 

*00033 

75 

33*20 

12,172 

•2728 

*3185 

•4764 

*0228 

•00059 

80 

78*51 

7,317 

1*073 

•1227 

•1580 

•0272 

•00116 

85 

! 188*1 

2,863 

6*566 

*03151 

•03528 

*0364 

•00267 

90 

' 454*6 

692 

65*71 

*003659 

•003776 

'0550 

•00705 

95 

• 1105* 

1 

86 

1285* 

*000118 

*000118 

i *0966 

1 

•02146 


The Ti^sult we Jiave arrived at shows that the meaai error, 
or standard deviation, in the values of the 3 per-cent 
Annuities in an aggregate experience such as the is 

about one-fiftieth of a yearns purchase from about 30 to 65 
years of age. Owing to the greater number of deaths at the 
younger ages in the 0^ experience this would about represent 
standard deviations for that Table from 25 to 65. 

If we suppose an experience in which the data were 
one-hundredth of the extent of the but similarly 

distributed, it is obvious, from a consideration of the process 
by which the above result was obtained, that the standard 
deviations or mean errors in the annuity -values would be 
ten times greater than the values found above. Hence, with 
an experience including about 1,000 deaths distributed 
approximately as in the data the deduced annuity- values 
between ages 30 and 60 would on the average be uncertain 
to about i'20, or from 1 per-cent to 1-| per-cent of their 
values. The standard deviations above obtained would be 
somewhat reduced in a small experience by graduating the 
experience by Makeham or by a suitable frequency curve, 
but not very materially. It would occupy too much time 
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to investigate this point, btit we may easily find a limit to the 
effect of any possible method of graduation in reducing 
the standard deviations of the annuities., In any ordinary 
experience such as the 0^, where the observed deaths are a 
small fraction of the lives passing under observation, the 
errors in the annuity-values will be due, 1^, to the mortality 
on the whole being above or below normal, 2®, to the 
distribution of the mortality being abnormal. This latter' 
factor can alone be affected by any method of graduation. 
Assume it to disappear altogether, and consider the standard 
deviation for say aao 8 per-cent) obtained on this 

hypothesis. There were approximately 100,000 deaths 
observed above age 50 in this experience. We have 
V'l00,000 = 316 nearly, and if we assume the mortality 

above 50 to be throughout subject to an error of + of 

the observed amount, this will be equivalent to changes 
A B . 

of 4- and 4* hi the values of the constants A and B 
. — 316 ~316 

respectively, which, taking the value of A =*00589 and 
log c= *039, are equivalent in their effect upon the^ annuity- 
value to a change of *00186 in the rate of interest per-cent 
and of *0341 years in the age. The combined effect- of these 
changes upon the annuity-value at age 50 is equivalent to 
.4: *0148 as compared with the standard deviation of *0185. 
obtained above. The very considerable standard deviations 
at the ends of the table ivould, however, be reduced in much 
greater proportion. 

The problem dealt with above is not the same as that of 
determining the standard deviation in the estimated value of 
an annuity on a single life. This problem, which is also of 
importance, has been dealt with by Dr. Bremiker in his paper 
On the -Eisk Attaching to the grant of Life Assurances 
{J.LA. xvi, pp. 216, 285). As this paper is not very available 
for students and the notation is not modern, it may be worth 
while to give the following short demonstration. Bor the 
sake of simplicity continuous functions are used. 

If the annuitant, aged a? at entry, die at the end of the 
time t the loss to the company granting the annuity, or the 
deviation from its mean value, referred to the date of entry 
will be r 
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and tlie,siim of tlie squares of all values of this quantity^ 
multiplied by the frequency in each case^ will be 




dt 


\t'Px)dt- 


pco 

Jo 


(A^)^-2e-«A,^ + e-2« d 


8 ^ 


dt 


{fPx)dt. 


Noting that 

poo 7 

and — I {tPx)dt = A'x (at rate of interest = — 1) 

we obtain from the above^ as the value of the standard 
deviation of and therefore with sufficient accuracy for 

practical purposes of (=rZ^— - nearly) the expression 
cr= |[A'..-(A.,)^]-i' 


the first term in the bracket being computed at the rate of 
interest — and the second at the rate -1. It is obvious 
that the standard deviation for will be the above expression 
multiplied by S ; and for Ax less the capitalised value of the 
annual premiums (Px) (which Dr. Bremiker terms the Kisk 
attaching to the grant of Life Assurances by annual 
premiums) the risk will be the above expression multiplied by 
(Px + S). The premium is here supposed to be payable 
continuously ; if an ordinary annual premium is in question^ 
we should multiply the above expression for cr by (Px + d ) . 
The arithmetical values of these risks attaching to grant 
of assurances or annuities computed at 4 per-cent^ according' 
to Heym^s mortality table (General Widows Fund of Berlin) 
are given in the paiDer referred to, and show, as is obviously 
the case from general considerations, that the ^^risk^*’, or 
average fluctuation whether profit or loss, attaching to the 
grant of assurances at annual premiums is considerably 
greater than that attaching to their grant at single premiums. 

In practice the important question for a life office, in this 
connection — and the same considerations apply to other 
classes of insurance — is the average amount of the annual (or 
quinquennial) fluctuation in profit due to the deviation of the 
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death strain from its average or normal amount. In a 
soundly managed office these fluctuations never approach the 
point at which stability is remotely threatened^ but they 
become of importance when they are sufficient to produce any 
serious variation in the rate of Bonus. 

The mean square deviation of will be found by putting 
8=0 in the expression for which in that case takes the 

indeterminate form ^ which must be evaluated^ according to 

the rules of the Differential Calculus^ by differentiating 
numerator and denominator. The resulting expression takes 
the same form, so that the process must be repeated, and the 
limiting value of the expression for cr- when 8=0 will be 
found to be 

^ '^=“2 ds - 

which may easily be reduced to the foiun 

r 

= [mean square duration— (mean duration)‘'^] 

This being the mean square deviation the standard deviation 
will be 

cr= [mean square duration— (mean duration) ^ 

the mean deviation irrespective of sign is approximately 
*/98cr and the probable deviation *674cr, or very nearly 

go- and go- respectively.* [Gf, Be Morgan, Encycl. Metro- 
politan^ Vol. II, p. 460, Art. 149]. If instead of a single risk 
the average of ?i^risks be taken, all the above quantities will 
be divided by a/ 7 i. 

* The exact values for the mean deviation irrespective of si^rn of the 
^pectation of life and of the annuity will clearly he and t\d:c respectively. 
Where t is in the first instance equal to Cj. and in the second to the term of the 
continuous annuity certain dfi=:dx‘ 



107 


NOTE A. 


On the Evaluation of the Successive Moments of the 
Binomial Expansion of + 

These important moments may be found very simply in the 
following manner. The expanded series being 


= U() + + Ih + . . . 4 * Uji - 1 + 'l('n 


= ' 2 uxy where the subscript is identical with the exponent of q, 

the successive moments round the origin will be '^Ux, ' 2 xicx, SA-c, 
■&C. We yill first ^find the value of 'liicx, ' 2 :cii>x, '^x{x-~l)uxi &c. 
We have 


v.it^=(2, + g)"=l»=l 
Sa'ita: = 0 xp"'+l xnp‘''~^q +2 X 


n-l„ , o„ 


p'" ^ . +nq’' 






. • 




Similarly 

^x{x - 1 X 2 X + 2 X 3 X 


l)(y?.- 2) H - 3 3 

13 ^ ^ 


+ . . . + Thill - 


= n{ii - ^ + in- 2)p'' + . . . + {Z'*" 

= nin - 4 - = Th{n - 


and similarly we shall find • 

'^xix -l)ix- 2 )ux == nin - l)U ~ a.nd so on. 


2?te = l 


S'-iCa: = Srifa; = nq 

^3 w(ii-l) 2 

^hi-x = 1 — ux = (? 

^-,4 ^a;(a:-l)(»-2) ft(w-l)(w-2) 3 

^Ux = 2. iix = q, 

6 6 

VO... - 1)(^’- 2)(s: - 3)^. n[% - l)(?i - 2)U - 3) ^.4 

^ ^ ^ 


by the formulae on page 59. Hence we have {see the demonstration 
in Note E, page 124), using for the %th moment round the origin 

■??Z0 == = 1 

mi = — nq 

??io = 2'^\ix + 'S\ix = - 1 )^“ + oiq 

r 

niz = + ^'2hix 4* '^\ix = ^{n -l){n-2)q^ ' . . A 

H- dn{n - l)g“ + oiq 

7?2-4 = 242%-a; + 36Shfa; + li'Eht'x + '2%x = n{n> - l)(n ~ 2){n - 3)g^ 

+ Qn{n -l){n- 2)q^ + 7 n{n - l)g“ + nq^ 


These last equations may be found directly, by means of successive 
differentiation, according to a method suggested by Bertrand 
{Calcul des Frobabilites, Chap. IV, Art. 62). We have 

{p + g)» = p»-2g2 


dq 


0 X + 1 X niF ^ 4- 2 X '^q 


4- . . . 4-(7i ~ l)?ipg’^"“^4-7?.g''^^”^J 
and g. ~(2?4-gy^ = j^l x?ip’^"^g4-2 X ... 


= 1st moment. 
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Similarly, if we diiferentiate the last series with respect to g, 
and multiply the result by q (to restore the power of q which 
is lost in the differentiation) we shall have 

l-^xnp ^q-\-2-^x ... + ?r^ 

“ 2nd moment ; and so on, so that 

[^th moment] = q~[{t- l)th moment.] 
dq 

Thus, the first moment 

^qj-ip + qT 

dq 

= w9'(i3 + g)®‘^ 

Second moment 

= g — {nq{p + g)'"''T = n{n - Vjrip + g)'“"“ + nq{p + g)“' ^ 
dq 

Third moment 


= g T - 1 ) 2 "( P + s)’“ " ■ + nip + 7 )“ ' 

dq 

= «(» ^l)(w - • 2 )?'^( 2 J + g)“'® + 2 »i(« - l)g-(p + g)”'^ 

+ »(«,- 1 )g“(j3 + g)’‘ ~” + nqip + g)“ ' ^ 

= m(w - 1)U - ^)q^ip + g)”"^ + 3a(» - l)g'(i5 + g)""' + ?ig(i3 + g)’*“^ 
Fourth moment 

= g -T- [third moment] 
dq 

— nin — l)(w' ~ 2 )(w — i)q^ip + g)” ^ + Snin — 1)(m — 2)g (p + g) 

+ 3nin - l)(ji - 2)qHp + g)“"® + 6«(«i - l)g"(j3 +g)"'^ 

+ n{n - l)q-ip + g)”"' + nqijp + g)“'^ 

= w(w — l)(w — 2)(w — 3)q^ip + g)” ^ + Qiiin — 1)(m — 2)g ip + g) 

+ - i)g'(i> + g)”“ ^ + «g(p + g)”'^ 

Putting unity" for all the powers of (p + q), these expressions 
are the same as previously found — see equations A. 

* This may not be done at any earlier stage because tbe.difierentiations are 
witli respect to iahing p constaut, wtereas to substitute jp + 2 ^ betore 

fiaishing the -differentiations would make p vary with q* 
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We have thus obtained the moments round the origin. Thence 
the moments round the mean may be found by the formulae on 
p. 41. Thus 

/zi = 0 

/X 2 = mo - (mi)^ = n{oi - + nq - = nq - ncf 

= %g(l-g)=?ii3g 

jtxg = m3 - Snii . ju-o - 

— n{% -l){n- i)(f 4 - ^n{n - l)g“ + nq 
- - 7i?(f 


= Tig - 3Tig- + 2n(f = Tig{l - 3g + 2g-) 

= Tig(l - g)(l - 2g) = Qipqip - q) 

/14 = - 4 Tni . //3 - Zmi . /xo - nii 

=Tig[(Ti^-- 67 i^+ IIti- 6 )g^ + 6 ( 71 ^’ - 3ti + 2)g^ + 7 (71 - l)g+ 1 
- 4wg(l - 3g + 2g“) - 6?iV^(l - ??) - 

f 

— nq\Z{n ~ 2)g^ - 6(71 - 2)g“ + (37i ~ 7)g + l] 

r r 

which reduces to 

nq{l - ?)[3(w - 2)(1 - q)q + l] 

= npg[3(»i - 2)pg+ 1] 

It is evident that all the even moments must involve p and q 
symmetrically ; while the odd moments will involve a symmetrical 
function of p and g, together with the factor {p - g), because they 
must vanish when ^ = g (i.e., when the curve is symmetrical) and 
must only change sign when p and g are transposed. 


It may be convenient to repeat here the Author's demonstration 
given, JJ,A,, xxvii, 214, of the value of the average deviation from 
the mean irrespectwe of sigQi, that is, treating all the deviations as 
positive. 

If we suppose the event to happen on times in the oi trials the 
deviation from the mean number np will be {m-np) which, since, 
jp + g is always equal to 1, may be put in the form [Trig - ( 7 i~m)p]. 
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This will be positive or negative as m is > ox <np ; and the 
probability of this particular de^uation will be 

91 . {m + 1 ) - m 

k. — m 

The greatest positive deviation will be nq (when the event 
happens at all the 9i trials) ; the greatest negative deviation — np 
(when it fails at every trial). 

Hence, we have the following scheme, in which m is to be taken 
as the next integer < np. 


Possible Delations from Mean llesidt np. 


to 

Magnitude 

Probability 

Magnitude X Probability | 



pu 

np'^q ^ 


, {n~l)q-p 


n{n — \ -^gp j 

(» — 2)2'--2p 

i ~-<7- 

— l)(w — 2) „ ^ ' 

W 



' 

! CTi 

1 ^ 

! 

# ... 


» If 

n . . . + 


(m + 1)^ — (n --m — l)p 

n . (m 2) . - 

' ' qjm+lQn-m-l 




i 

i 

iWi" 


n ... (m-i- 1) 

* ' —771 

i 

n ... m i 


1 

n—m ^ ^ 

\n-'m ^ 

o 



_» (m + l) It 

\n — m — l ^ ^ 

c: 

tc 

o 






np(p^-^ 

npq'*^ — n . (n — 1 

iP 

'Yik. -5 —np 

qn 



If the final column of products is examined it will be seen that 
each positive term is cancelled by a similar negative term in 
the succeeding product. Hence, the total of the products, that 
is to say, the average deviation, is zero, showing that np is the true 
mean result, the positive and negative deviations from which 
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exactly balance each other. Of the terms above thc' horizontal 
line, representing the positive deviations, the sum is, of course, 

equal to the only uncancelled term, — - "t — ~ 

.and similarly of the terms below the line representing the 
negative deviations the sum is - ^ ^ ^ 

Hence, the average magnitude of the deviations, that is, the total of 
every possible deviation multiplied by its probability, regardless of 
sign, is 

\Vlz 


( m + 1 ) 1 , 2 , _ 4-1 72 , - -ill 

•m-1 ^ ^ 


which, since the sum of all the probabilities is necessarily 1, will 
also be the average or mean deviation. This result is exact, not 
approximate, but where n and on are large numbers it is necessary to 
;simplify it by the use of Stirling’s formula, which gives for large 
numbers 1^1= J nearly. 

: Put {a) into the equivalent form 

2 1 71 (tI — on) m+l^n - m . 

T“n ^ U y 

1 7 a 1 71 -771 

using Stirling’s approximation to the factorials, we have 


Since on is the integer immediately below Tip, we may write 
m ~oip-h I 71 ~ on = nq-\-h (where h is a fraction) ; hence, we get 









but where oip and oiq are large numbers, h being a proper fraction, 
.. the last factor is very nearly equal to 1, and { 1 -] and 

\ Tip/ 

/ k\~ 

\ + equal to and respectively ; hence, 

the above expression reduces to 

^^7ip(7 = *79788 \/ npq = ^ V Tipg^ very nearly. 
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Although this result has been obtained on the assumption that 
and nq are large, it ^^dll be found to be very approximate 
even for small numbers. As an extreme case, suppose 120 lives at 
risk, the probability of death in each case being *02 ; the 
'‘expected” deaths would then be 2*4, and the extent by which 
the actual deaths would, on the average, exceed or fall short of this 
number would be given by the formula as 

4 / 

g V2-4x •98-1-227. 

The true value of the average deviation given by formula {a) is 
2 , (•02)^(*9S)^^^ 

= 1-243, 

almost identical with the approximate result above, 


I 
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NOTE B. 


On the Use of Logarithms of the Unadjusted Terms 
OF A Series. 

Consider the number of eases out of a given series falling into 
a particular group ; or the number of deaths, or analogous events, 
at a given age or group of ages, accruing out of a given number at 
risk. Suppose the series to consist of n cases in all, and let the 
true probability of any case falling into the particular group be p, 
and let m = np. Let the observed number of cases in the group be 
Qn == m + z, where, as we have seen, z has an average value of zero, 
z^ has an average value of 



01 

z^ has an average value of 

«y(l -y)(l - 2p) = &c ■ 

If we operate with the logs of the oliserved quantities on, we 
must avoid by arbitrary grouping cases in which on is zero, or on! \oi 
very small when the logs become infinite or yery great ; but when 
this is done we shall still find the logs of the ungraduated numbers 
less on the average than the values of the graduated (or true) 
numbers. This may be easily seen from a simple example. Let 
71 = 4, and p~\, in which case on — 7ip = 2. The observed values of 
on may be anything from 0 to 4, and we shall have the following 
possible cases : 


Values of 

Relative 
frequency 
of these values 

log m' 

Products 

(2)x(3) 

( 1 ) 

(2) 

(3) 

(4) 

0 

tV") 

-CO ) 



[a 

[ --097 

(say) — *030 

1 

3 

•000 ) 


2 

TS 

*301 

*113 

3 


•477 

•119 

4 

ih 

•602 

•038 

r 

Total 

1 

... 

•240 
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Here, to avoid the cases in wliicli the observed value of m is 
zero, vre have combined the first two groups, taking four cases in 
v’hich «/ = 1, for one case in which m =0, thus giving an average 

value of m — the logarithm of which is -*097. Notwith- 
5 

standing this dence our average value of log m is only *240 as 
compared with the value of log ??? = *301 (where ?/i = 2 is the true 
value or average value of ni ). 

Assume that on the average 

log [??■/ (1 + Z*)] = log m 

= log[(m. + ^)(l +Z;)] 


Whence 


= log m + 


&e. h T 

Til 2rir 


+ T 3 , Ac., + Z; — :- + Ac. 
2m-‘ dm 2 


Insert the average' values as given above for Ac., 

.7 Ir , n n-m {n-m)(n-2m) , n 

Z.* - — + Ac. = - — --w- s + Ac., 

fk 2 2nm on~nr 

or, omitting terms of the second order, 


which, again omitting terms of the second order, may be vTitten 


log (1 + Zj)] = log m + J = log 

where p is the observed value of the probability p. 

If this expression be substituted for log m in the example 
given above, we should have as the sum of the products of 
col. (2) X col. (3) the value *309, which is very much nearer the 
true value -301 than the uncorrected value in the above table. If 
we take larger numbers, as 7i= 100, m = np^ 10, we shall find by a 

similar process the average value of log io(m' + ) is *99987 as 


compared with the true value of log m = 1*00000. "Where the 
numbers n and m are very large, the correction, of course, becomes 
insignificant. 


p 
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It may be shown in like manner that, if we are dealing with the 
reciprocals of the observed values, then, on the average. 


— ; — \ ; = - nearly, 

m + 1 -p m 


and again, on the average, 

^ 4 * - ^ = J m 

Eeverting to the question of the use of the logs of the 
ungraduated quantities, it will be found that if the above results 
are made use of in practice, the logarithms will be over-corrected. 
The reason for this is that we do not eventually arrive at the true 
values of log Qn and log p, the graduated values being still affected 
by an outstanding or unbalanced error. If our series consists of a 
large number of groups, these outstanding errors will be com- 
paratively small, and the above correction will not be much in excess ; 
but if the number of groups is very small, our graduated quantities 
must necessarily follow rather closely the original values, and the 
use of the above formula would largely over-correct the series. 
Suppose, for example, we had a series of ten groups. We should 
require about five groups to obtain the general form of the curve, 
or to determine the constants of any frequency curve, employed, 
hence the errors of the groups would only be reduced by the ratio 

of approximately and the correction Jc as shown above should 
be reduced by half, and proportionately in other cases. 
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NOTE C. 


On the Eationaxe of the Method of Least Squares. 

In statistical work it often happens that a number of constants, 
entering into the known mathematical form of a given function, 
hav'e to be evaluated from a much greater number of observed 
values of the function. W e may, for example, have three constants, 
such as cr, y, z, in the expression lx + my + nz = F, and fifty observed 
values of F (embodying difi'erent values of the coefficients 
I, m, n) from which to determine the constants. If the observed 
values of F were rigidly accurate, any three of them, or any three 
combinatioiis, ^vould suffice to determine the constants, and it 
would be immaterial what set of three was selected, since all would 
lead to the* same resiilts. But generally the observed values of F 
will be affected by errors of observation and hence will not be 
strictly consistent; and taking the above example each of the 

^ ^ 190 Q different sets of three indmdual equations 

6 

would in general produce different values of the constants : so that, 
apart from the prohibitive amount of labour required in the solution 
of so many equations, we should have no means of deciding which 
was the best or most advantageous solution, or how to combine the 
solutions in order to obtain the best average results. The method 
of least squares supplies the means of combining the original observa- 
tions in such a manner as to produce a number of equations, equal 
to the number of unknowns (in the above example, three), the 
solution of which by the usual process leads to the most probable 
values of the unknown constants. 

Suppose that the observed function F is a linear function of the 
variables x, y, z, of the form lx + 7)iy + nz . , . , and that the errors 
in the observed values of F follow the ‘‘normal law”, so that the 
probability of an error h is proportional to where the standard 

deviation of F is We shall further supi^ose that the equations 
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have been so weighted that the value of c is the same in each of the 
observations, or that the ‘‘ precision ” is uniform. Thus, for 
example, if in a given equation the probability of an error of h in the 
observed value of F is proportionate to with a standard 

deviation of then multiplying the equation by we shall 

Av/2 


have an equation with a standard deviation of — j= as before, and 

the probability of an error Ic will be proportional to as 

required. 

Let there be t equations as follows (where t is supposed greater 
than the number of unknowns, say s ) : 


liX + rtiiv + niZ-\- ... - i 4 qF == ki 

Ux + m2y + n^z + . . . - Wo'F == ko 

IfX + lUty + + . . . — = ki^ 


(A) 


where F represents the true value of the observed function and 
/ji, /^o, . . . the errors of observation. The chance of the errors being, 
by hypothesis, respectively proportional to . . the 

chance of the conjunction of these individual errors wilLbe propor- 
tional to which %vill obviously have its greatest 

value when the quantity in brackets is a minimum. the most 

probable values of the constants will be those that give the greatest 
probability of the observed event, i.e., the happening of the given 
combination of errors. Thus, the most probable values will be those 
making [ki/G^ + k 2 ^lc^+ . . .] or Uctli? or '2kt a minimum — hence 
the name ‘‘ method of least squares.” 

Now we have 

+ . . . -%F)-] 


and since x, z . , . are supposed to be independent, the minimum 
value must correspond to such values of a:, 2 /, ... as will make the 

partial differential co-efficients of this expression, with respect to 
XyV^z,..^ all vanish.* Hence we must have, omitting a 
common factor 2, 

+ . . . -Wt‘F)] = o" 

2[mt{ltx + mty + ntz+ . . . -'iyiF),] = 0 ^ 
y[nt{ltX + mty + ntZ+,..-'Wt'F)]^0 
&c., &c., &c. / 

* These conditions, though necessary^ are not in general sufficient to ensure a 
minimum, hut in this case it is obvious that a minimum exists because high 
negati'ce values and \n^ positive values of ar, y, iP . . . alike give large values to 
the function. 
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(the suniniatioii extending to all Tallies of t) as the system of 
equations, s in number, for determining the most probable values of 
X, y, z. Hence the rule : 

First prepare the equation by multiplying each by its proper 
weight (the reciprocal of the probable error or standard deviation), 
thus giving a set of equations with a uniform p.e. and s.d. Multiply 
each equation by the coefficient of x and add all the results together ; 
next multiply each by the coefficient of y and add all the results 
together, and so on : the resulting aggregate equations, solved in 
the usual manner, give the most probable values of the constants.” 

It ^vill be seen at once that if there is only one constant to be 
determined, the method based on the normal law of error gives 
the weighted average, i.e, the total of the weighted values divided 
by the total weights, as the most probable. Conversely, it may be 
shown that if the weighted average is the most probable value, then 
the facility of error must follow the normal law. Apart, however, 
from any hypothesis as to the law of error, it may be shown 
mathematically that the method of least squares gives results which 
become more and more nearly accurate as the number of observations 
increases. Considerations of a more general kind will also lead to 
the conclusion that the method must produce very good results. 
Without gjvdng any definite form to the law of erroi*, it is obvious 
that large errors are less probable than small, and that the most 
advantageous systen? of values for the unknown constants will be 
that which produces, on the whole, the smallest numerical deviations 
(irrespective of sign) between the adjusted and observed values of 
the function. Now, if the law of error is supposed unknown, we 
cannot investigate mathematically the conditions required to produce 
a minimum deviation irrespective of sign ; and the simplest function 
of the errors which is independent of sign is the square of the errors, 
which will be the same for a positive or negative deviation, 
and at the same time attributes a rapidly increasing importance, or 
disadvantage, to the errors as they increase in magnitude. Hence 
we can see, in a very general way, that a method which gives 
a minimum value to the sum of the squares of the eri-ors, is likely 
to lead to satisfactory results consistent with elementary notions as 
to the nature. of the errors. Moreover, in actuarial work we usually 
have to do with numbers sufficiently large to make the normal law 
of error very near the truth. 

Reverting to the system of equations (B), it will easily be seen 
that if F is a parabolic function of the form a; + ay -p + . . . the 
equations for determining x, y^ z, , . . &c., are., equivalent to 
reproducing 2F, 2oF, SdT (SwF, 'IwE.a, &c., if the equations are 
weighted), i<?., the successive- moments of the observations. 
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It has, so far, been supposed that the function F is a linear 
function of the constants If this is not the case, 

suppose that the equally weighted equations, from which the values 
of X, y, z, . . . are to be found, are of the form 

fiix, ) -OTiF = ^'i' 

f^{x, y, z . . .)- = 1-2 (C) 

&c., &c., &e. j 

where /i, A • • • are known functions of the variables z , , , 
Ey means of t of these equations, or of t combinations from 
amongst them, or otherwise, find approximate values of o:, y, z , . . 
say £5\ • • • ; and suppose that + 8Xy y — y^-^Sy, z = z^ Sz, 
&c., where it may be supposed that Bx, Sy, Bz , , . , representing 
small corrections to be found, are so small that their squares 
may be neglected. Then if 

/i=/i(*‘, y\ • • •) > ^1 = 

and so on, equation (C) will become 



These equations are linear functions of the small corrections 
Bx, By, Bz , . . which can accordingly be found by the rules already 
derived; and hence are found the corrected values »; = £c^-h8a*, 
^ &c. The process can be repeated, if greater accuracy is 

desired, until the corrective terms become insignificant. 

Ill the important particular case of a graduation by Makeham’s 
formula, the original equations are of the form 

2 2 " I 2 2 

2 
r 

{w being the “ weight '0- Approximate values of the constants, say 
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A', B', c , being found, the resulting equations for determining 
SA, SB, 8c are as follow, representing A' + BV/^ : 

(2w)SA+ (2w.6''’'^5.)3B + (^2wBV*"Aa:+ 

= (2w) (/X 1 ”■ M' 35+ l) 

(2w.c''''*'^)aA+ (2W^®+^)8B+ (^2w.BV^*.a; + ^-^Sc 

, a;+l 

= (2w.CJ ^)(Ma;+-i “■M’a;+l) 

2 2 

(2?«.B'a;+ 5 c'*“ 2 ) 8 A + (2w.B'a;+ 5 c'-*) 8 B 
+ {2w.B'“(3;+ 

= 2«o. B'a; + ^ «' ^ i " /^ *+ i) 

For an example, see J.I.A., xvii, 161-71. 


XOTE D. 


On the Use or the Binomial Curve to Eepresent 
A Continuous Series. 


If the Binomial curve 2 /= , — p" — ^ made high contact with 

\x \n — x 

the axis of x at the points x= - I and = + 1, where p becomes 
zero, it could be conveniently employed to represent a continuous 
curve in lieu of representing merely isolated ordinates ; as in 
that case the moments of the continuous curve would very 
closely agree with those of the isolated ordinates. The same 
would be true of any series of equidistant points on the 
curve supposing these to be fairly numerous. If, for example, 
we suppose the values of y tabulated for every intrgral value 
of then the i^th moment of the curve would be increased 
by multiplication by the factor and •"•from tho observed 
numerical values of the first 4 moments h and the remaining 
constants could be obtained. As, however, the curve y cuts the 
axis of X at an angle at both limits, this method of proceeding 
will lead to approximate results only when n is fairly large. 

The area of y treated as a continuous curve may be approxi- 
mately determined from the well known approximate formula 


/ 


'rt+i 

y.dx^ 

-1 


1 11 

^y-l+yo + • • • -Pn-i-l + ~ 


\dxJn+iJ 


y^ 1 and Pn+i being of course equal to zero and the series + • • • 
is the expansion of (p-^qY^ where we assume p + 5'==l, and is 


1 1 

therefore also = 1. As the factor rr vanishes for £c = - 1 ; and [ 


n-x 


vanishes for a; = w-i- 1, we have 

KdxJ^i -x ^ ^ dx\x)^i 71 + 1 ^ ^ 




\dxJn+l NaJ 




.d 1 


dx 


i-) =--A_ 

— X J yi+i 7l-h 1 





since v,- as is known = 1 when x= — 1. Hence the area of the 
dx\x 

curve y becomes 



I ^ 1 

12 u+1 V pq 


Analogous expressions can be found for tbe approximate value of 
the moments 

Jmjdx . Jx-yclx ^ 
fydx ' fyclx ^ 


but the relations which result do not lead to sufficiently convenient 
formulae for practical use. 



NOTE E. 


On the relations between the Successive Moments and 
THE Successive Summations of a series. 

The relations given on p. 60 may be systematically demonstrated, 
and developed to any extent that may be required, by means of the 
ordinary interpolation formulae combined with a table of the 
power-differences usually known as the Differences of Nothing 
— see Text-Book, Part II, Ch. xxii, Art. 11 ; Sunderland’s “Notes 
on Finite Differences ”, pp. 24-5. 

We have, by the ordinary interpolation formula, 

. x.x - I . 

= + xAVq -}- - A ^0 + • • • 

andhence = + . . • 

2 ^ 

so that = + + ••• 

= {'2icq)vo + {2\) . Avo + ( 2 V >) • + . . . 


using the notation of p. 60. 


Put % = and we have 


Putting m equal successively to 1, 2, 3 ... , taking the differences 
from the table of the differences of nothing, and noting that the 
first term vanishes whatever the value of ??i, we can write down 
at once — 

'^X^'Ux — "t 22^'U'2 

= 2n/a + Q^u.2 + 62^% }.. A 


2iKX = 2 'Mi + 142*% + 362^% + 342®% 

2a;®% = 2 *Mi + 302*% + 1502*1^3 + 2402*^4 + 1202*%/ 


These equations, divided by '2ioq give the expressions for the 
moments set out on page 60. 



125 


Taking next the usual central diiEference formula, 

r® = fo + .tuo + ^ So + Co + (fo 


, .ir-lKr^-k) 

+ ^ <’0 




?^5n+ (a^-l)gfe+l) ^^ ^ ar(a:+l)(;r-l) 
3 4 


d. 


(x - 2)(.r - T),i-(.r + l)(r + 2) 


= + M, + ~ • I ^ I 

{(x + 2)(x + l)a;('B - l)^ + {(g + l)x(x - l)(a: - 2)} dp 
2 !4 


Thus, 


—(witii) = (2ife;)ro + (Sri{a:)ao + {2r(a; — l.)ux + 2(.r + ].)x.ux} ^ 


, +{2(x+lM;r-lW}|+... 

= (SMo)i-o + :^i.ao+ J(2V + 2V)So +S‘u2.Co 

+ k25«3 + 25«2)^fo+-. - 

the law of the terms being manifest ; or, abbreviating the expression 
|(2*m* + 2‘«x+i) 

by the single symbol 'Zhix+h, the series may be written 

'2{uzVx) = {'2uq)vq + (2-Wi)ao + (2?%^) + &U'2) Cq+. . . 


Putting Vz — x‘^\ forming the central differences of as shown 
in the scheme below, we write down at once 

'>o "^2 

IxUz — 2 Ml 

Ss®!fa = 2SV 

■ B 

'2xhix = GS^jfo + 2^1 

9 

^iiAlx = 242)’^/.2i + 


9 



X—X^ 


X 

Xx A a" 

X 

Vx A a" A^ 

X 

Vx A A^ A-^ 

-2 

4 

-3 

-27 

~3 

81 


-3 


19 


-65 

-1 

1 2 

— 2 

- 8 -12 

-2 

16 50 




7 6 


-15 -36 

0 

0 (0) 2 


- 1 - 6 

-1 

1 14 24 


1 


1 6 


- 1 -32 

1 

1 2 

0 

0 (1) 0 (6) 

0 

0 (0) 2 (0) 24 


3 


1 6 


1 +12 

2 

4 

1 

1 6 

1 

1 14 24 




• 7 6 


15 +36 



2 

8 12 

2 

16 50 




19 


65 



3 

27 

3 

81 


The simplification in the formulae is, of course, due to the fact 
that when m is even the odd central differences vanish, and when 
m is odd the even central differences vanish. 


It is sometimes required to find moments of the form 

For this purpose we may use the formula {see Sunderland’s Notes 
on Finite Differences,” p. 32) — 

■ ^ 

= ^ (^0 + ^i) + - (cy + a’ - l) At'o + ~ A‘'(?;o,+ v - 1) 


1 . l) + ? r(a!-~ l)fa- 2) *3^ 


13 




. {x+l)x(x-l)(x--2)l 
+ ^ 4-^-2) 

whence we find, in the same manner as before, that commencing 
with ViWi, we shall have 

2^;^; . Wx = 2wi . . A^?o + . ~ {^\ + A^'w - 1 ) 

1 


4•2%^2iA^'^^-l + 2^%i(A^t;-l + A^'y-2)+ • • - 




]27 


Putting 


Tz - 


/_3+,,r.(S21rl)- 

\ 2 / 2''* 


the following Table shows the values of (%x - I)'"' and its differences, 
whence Xx and its differences will be found by dividiii<:»* bv 2"*. 


1 X 

2x-l 

(2:r-l)‘ 

A 

(2:p-1)^ 

A 

A-' 

(2,r-l)3 

A 

A- A^, (2,^-1)-^ 

A 

A- 

A^ 


I —2 

— 5 

— 5 


25 


■ 

-125 


, 625 








2 


-16 



9S 

_ 

544 




i-1 

-3 

-3 


9 


8 

- 27 


-72 ' 81 


464 



i 



2 


- S 



26 

48: 

SO 


-384 


i 0 

-1 

-1 


1 


s' 

- 1 


-24 ; 1 


80 


384 : 



0 

2 

(1) 

0 

(8): 

(0) 

2 

(0) 48 : (1) 

0 

fSO) 

0 

(3S4) 

i 1 

-rll 

4" 1 


1 


8; 

4- 1 


+ 24 1 


SO 


3S4 




2 

! 


8 



26 

48 

so 


334 

I 

! 2 

T 3 

+ 3 

2 : 

9 

16 

8, 

+ 27 

98 

+ 72 i 81 

544 

464 


1 

i 3 

-f 5 

' S 


25 



4-125 


! 625 






Dividing by the appropriate power of 2 and inserting the values 
of + ^ (A'ro + 

the last formula becomes 


tx'i-X-X 




Wx^ commencing with ( ~ ) u\ 


, — (w]jeii ■??z = 1) 

= (when m = 2) 22Vo + - ISzci 
4 

= (when m = 3) 

4 

= (when m = 4)242''2r3 + 2^t■l 

46 


(C) 


Writing now = and so on, i,e.^ reckoning the ordinates from 
zero, so that t 
these become 


© m 

+ -j -h \ . , 


N.7?2i= E-% 

N . mo = 22^2^1^ + 7 
4 

N.??!3= 

4 

N . 7?24 = 242^22.7i + ^Uh , 

16 'J 


(D) 
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This will be made clearer by a numerical example. Take the 


following series. 


X 


Distance 
from origin 
multiplied 
by 2 
— d 

Vx>^d 


lAx X 

Ux X 

•5 

16-74 

1 

16-74 

16-74 

16-74 

16-74 

1-5 

15-69 

3 

47-07 

141-21 

423-63 

1270-89 

2-5 

14-70 

6 

73-60 

367-50 

1837-50 

9187-50 

3-5 

12-99 

7 

90-93 

636-51 

4455-57 

31188-99 


60-12 


228-24 

1161-96 

6733-44 

41664-12 




-f-2 = 

-i-4=- 

_^8 = 

■^16 = 




114-12 

290-49 

841-68 

2604-01 


The alternative method by summation will be as follows : 


1 ^ 



2“Ua: 




-5 

16-74 

60-12 

144-18 

(114-12) 


... 


1-5 

15-69 

43-38 

84-06 

137-73 

204-39 
(135-525) < 


2-5 

14-70 

27-69 

40-68 

53-67 

66-66 

79-65 

3-5 

12-99 

12-99 

12’09 

12-99 

r 12-99 * 

12-99 


2-% = 114-12 

22W+ t2w = 2 X 137-73+7 x 60-12 
4 ■ 4 

= 275-46 + 15-03 = 290-49 

62^m,+ 7 SV = 6 X 135-525 + 7 x 114-12 
^ 4 4 

= 813-15 + 28-53 = 841-68 

242=W4+52®i(i}+ 24 x 79-65 + 5 x 137-73 + x 60-12 

16 ' 16 

= 1911-60 + 688-65 + 3-76 = 2604-01 


With a heavy series of terms, the saving of labour by the 
summation method will, as may easily be seen, be very considerable. 
A further saving of labour may be obtained by calculating the 
moments round some convenient central point, and thus breaking 
up the series into two parts in the manner indicated in 
Mr. Elderton’s treatise, pp. 22-33 ; and any of the formulae described 
in these notes may be applied in this manner. 



NOTE F. 


On the Identity of the Method of Moments and Method 
OF Least Squares in the Case of an Exponential 
Function. 

Suppose y an exponential function of x so tLat 

Say. 

Then if y be taken to represent any group in a frequency distribu- 
tion where the number of groups is large the probable error in y 
will be approximately Jy. Assume the true values of y, i.e,, the 
true values of 5, c . . . , to be approximately known, and Jet the. 
observed vi^Jues of y^be denoted by y. If, then, we weight each 

equation y — the factor writing 

V y 

-7-(2/'-^)=0 (l) 

• vy 

we shall have a series of equations of condition in which the 
probable error is in each case identical ; that is to say, they will be 
suitably weighted for the application of the method of least 
:squares Note C., p. 117-8). 

Writing 2 /^ = (y + + Sza,) 

da do 

= y (l 4- Sa + a;. §5 + £c^. Sc, &e.) 

•equation (l) becomes 

- V [y(l 4- 4- &c.) - = 0 . . . . (2) 

vy 

and multiplying each equation successively by the ^coefficients of 

85, &c., i.e.^ by~y^, &c., and taking the sum of each 

^y ^y ■‘^y 
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set of products, according to the rules of the method of least 
squares, we get 

2[2/(1 + Sa + x.^ + 3r. Sc, &c.) = Q 

2[pj(l +8a + a\8Zi + a;“.8c, &c.) - a-.c*] = 0 

as the system of equations for determining, according to the 
method of least squares, the small corrections to be applied to the 
approximate values a, h, c, . , . used in obtaining the approximate 
values of y. 

Now, obviously, if y is so taken that 

2(2/-^") =0 
=0 

2:ftr(y-~/) = 0, &e. 

i.e,, if the values of the constants a, &, c are found by the method 
of moments, &c., the above equations are satisfied by 8a = 85 = Sc — 0 
that is to say, the corrections are zero, or the values found for 
a, 5, c ... by the method of moments are in conformity with the 
method of least squares on the assumption that-the observations are 

properly weighted by multiplying by the factors -y-, the weights 

Jy 

being assumed invariable. It may, however, be supposed that small 
variations in the constants, a, 5, c, . . . would produce slight 
variations in the weights, in which case other solutions may exist 
which would also lead, by the method of least squares, to equations 
satisfied by 8a = 85 = 8c = 0; but as it is well known that small 
differences in weights have practically no effect on the results, it is 
evident that any such alternative solution must be very close to 
that already formed. 



NOTE G. 


On obtaining the value of Makeham’s constant c 

DIRECT FROM THE EXPOSURES AND DEATHS. 


As stated in the text an exact value of this constant is not very- 
important, and this may he illustrated by reference to the data for 
ascending premium assurances given in Table X. An approximate 
value for c may readily be found by a process such as the follovdng, 
which is in principle analogous to the aggregate method employed 
by Mr. King in the Text-Book, Part II. Take the values of for 
the central age of each group in Table X. Eeject the initial and 
final values, as depending upon only two and three deaths respectively. 
Take the sk values for central ages 32^ to 57 weighted respectively 
by the factors 1, 3, 5, 5, 3, 1 ; weight the six values for central 
ages 47i t(?72-|-, and also for 62 J to 87| in the same manner. We 
shall then have the following totals : 


X 1 = ‘0119 

M47iXl = *0l37 

yU62i X 1 = *0340 

X 3 =■ *0345 

X 3 = *0534 

fj-mi X 3 == *1647 

yU42i ^ 5 = *0655 

5 = *1160 

At72i X 5 = ‘3660 

JU474 X 5 = *0685 

X 5 = *1700 

JU774 X 5 = *5720 

X 3 = *0534 

iU67i X 3 = *1647 

Msej X 3 = *7540 

^ 1 = *0232 

;47,^x1=.-0732 

M875 X 1 = *3379 

Si=-25'70 

So = *5910 

83=2-1286 


If the mortality follows Makeham's law, we shall have 


Sa-So ^ 1-5376 ^ 15 
Sj - Si -3340 

since 15 years is the interval between the centres of our empirical 
groups. This gives log c = *0442 nearly. If we take the sum of 

x 2 
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the unweighted values of /x in three groups for ages 321 to 47-2^ 52-J 
to GTi, and 724- to 8 Ti, we should obtain in similar manner. 

oo — So *6136 •• 1 A/jjQ 

We may conclude, therefore, that log c probably lies between 
*044 and *045. The values of y. for ages 274 and 924, which we 
have omitted in the foregoing, are respectively much below and 
much above the general curve. If these values had been 
included duly weighted, we should have obtained a slightly larger 
value of log c, nearer to *045. 

If we adopt *045 as an approximate value, we obtain for the 
values of the constants A and B, by the process described on p. 65, 

. A- *00950 B- *00003712 

W e will call this curve (a), the deviations from the adjusted values 
of ^ in Table X being shown in the Table below. AVe might 

Ascending Premium Asstirance Experience. 


• Demotions in Computed Deaths for Curves (a) and (/3). 


. 

Middle 

Age 

of 

Group 

Observed Deaths 
corrected as 
per Table (X) 

Deviations 

Computed Deaths — Observed Deaths 

Curve (a) 

We = *045 

Curve (/3) 
log c = *046 



+ 


+ 


m 

•8 

*9 


1*0 


324 

29*2 


3*5 

. . . 

2-8 

374 

102*0 


1*8 


*2 

424 

175-2 


7'3 


5*9 

474 

191*7 

12*8 


13-b 


: 524 

218*6 

3*3 


2*0 


574 

228*4 

7*3 


5*1 


624 

255*4 

«.• 

2-7 


5*2 

674 

274*4 


24*8 


26*5 

724 

205*6 

12*6 


12*6 


i:. .774 

. 151*6 

12*1 


13*6 


i ! 824 

84*8 


6*6 


5*1 

874 

22*3 


*5 

' *2 • 


924 

2*1 


1*1 


1*1 


Sum of deviations 

48*4 

48*3 

46*9 

46*8 


Second sum . . 

38-9 

37-5 

' 39*3 

39-9 


Third sum . . 

- . . . 

13*7 

71*8 

31-9 

36*9 
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expect from our first rougli approximation to log c that a smaller 
value than *045, say -044, would give better results. We find, 
however, that the third sum of the errors of the (a) curve is 
negative, and this indicates an increase in the value of log c. 

Since a higher value of c hollows out the curve at the middle 
ages, increasing the computed deaths at the extremes of the table, 
it is clear that the efiect must be to increase the third sum of the 
graduated deaths. 

The probalility is therefore, that curve (a) will not be much 
improved by changing the value of c. 

If we take the alternative value log c=*046 we find the 
de^fiations from the adjusted values of 0 in Table X are given for 
curve (3 on the previous page* 

There is little to choose between the two graduations, notwith- 
standing the smallness of the third sum of the deviations in curve 
{(3), for against this may be put the fact that the three largest 
errors in (a) are all increased in (13). On the whole the curves may 
1)6 taken as showing that an approximate value of log c is generally 
sufficient, and that nothing is gained by computing this constant to 
several places of decimals. 

It may at first sight appear inconsistent with the general theory 
to adopt Allies of the three constants which do not make the third 
sum vanish ; Le., the third moment of the graduated and ungraduated 
figures identical. ?t must, however, be remembered that the method 
of least squares (and with it the method of moments) assumes that 
the form of the curve is known a primi, in which case the method 
gives the means of determining the most probable values of the 
constants involved. When, however, we are dealing 'with a 
mortality experience, we have no a priori right to assume that 
Makeham’s law is strictly applicable; and, if it is not, the 
deviations instead of following the normal law as assumed in the 
theory of least squares, will include systematic deviations due to 
departure from the Makeham law. In these circumstances the 
method of least squares is not strictly applicable, and we are 
therefore justified in allowing other considerations to guide us in 
selection of the constants. 


We may here note that if the exposures are represented by a 
frequency curve, the deaths being recomputed to correspond to the 
graduated exposures, then the value of log c may, in general, be 
calculated from the moments of the exposures and of the recomputed 
deaths. This can readily be done if the exposures are represented 
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by a binomial curve (see Calderon, JJ.A., xxxv, 157/, although 
precautions must be taken so to group data that the number of 
terms in the binomial is not great — not more, say, than five or six ; 
or by the normal frequency curve (see Elderton's “Frequency 
Curves’’, pp. 98-100); or by the curve y — where, if Eq, Ej, 

&c., represent the successive moments for the exposures round the 
origin, and 9q, Oi, &c., the similar moments of the recomputed 
deaths, 

we shall have — / /i \ 

y f 3 _ 

\Eo El/ 

whence, y being known, log^c is easily found. 

The above relation may be thus demonstrated. The force of 
mortality at age a: is assumed to be of the form A + = A + 

= A + Bg^’^, putting X = logeC. Thus the death curve will be of the 
form where the second term is of the same 

form as the first with y - A substituted for y. But by the well- 
knowm properties of the Camma integral (see Williamson’s 
“Integral Calculus”, Art. 120) we have 


( ^ _ ^o'\ 
log^c _ \ Ei Eq / 


/a3 /K 

J 0 zJ 0 


, - XZ -.M - 1 


whence it is easily seen that, writing E'o, E'l . T . for the moments 
of 


Eq = Eq 
E l = Eq X 


Eo = Eo X 
whence 


m+1 


(m + 2)(m 4- 1) 


^o = AEo + BE'o 

6Ii = AE3+BE'o— ^ 
y — A 

ft, = AEo + BE'o^--%ii> 

iy - A)- 


^„-Eo = A + b|-» 

JCio 




= A + B' 


A = B'- 


= A + B' 


y-k 


y ~ k 




^2-fE2 = A + B|-“4!^„ =A + B' 
Eq vy — Aj 


(y - A)^ 


so that 


A-Ai- 


7- A- 

yjzA - yrJ^e 
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If the exposures, as often happens, can only be represented by 
a curve of the form = (^vhere ir represents a propor- 

tionate part of the range of the curve so that ./* ranges between 0 
and l) and if, as before, we represent the successive moments for 

exposures and deaths by 5 Mq, where /;?oandMoare 

made = 1, then writing 

(a + l)-(a + ^ + 2 )Mi = Ro 


(a + l)Mi - (a + /5 + 2)M. = Ri 

it will be found that, putting r for the range of the curve in years 
of age, 

1 !Ro 

r ogcC- 

^ Ri. 

(M3 - Mo) - - Jdo) 


from which as the numerical value of all the quantities except 
logg c and h are known, these two may be easily found. 

This ntay be shown as follows : — 

Let the curve of exposed to risk be represented by the type 

where the entire range of the curve is taken as unity, and assume 
/j, a, and yS to be determined in the usual manner. 

Let the curve of the recomputed deaths be of the form 


+ = + . . . (1) 

i.e., we assume that - (log ?*) = A + 'Bri^ 

’ do: 

As regards the curve .r, we shall have 

log logir + 13 log (1 - x) -p yx 


(h __ /'a 
dx \X 


13 

1-x 



or, multiplying both sides by 

(a + BK+i + 7(»‘+^A+-)]j . . (2) 

. . dx 


t 
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Integrating the left-hand side of this equation hj parts, and 
noting that the factor is zero for the limits 1 and 0, 

J [(?^ 4 - ~ (i^ + l)x^]z'ch = [ap/ - (a + 

that is 

(t + 2Wt+i -(t-h iWt = aon't - (a + /^)m't,+ i -f- yOih.-hi ~ 

and 

(a -f ?^ + l)mt -(a-hjS-i-t-h 2}nit+i + - m!t-{~ 2 ) = 0 . , (3) 

there m't represents the ifth moment of the curve ,r round the 
ordinate ;r = 0. 

If 7 = 0, the curve becomes identical with y, and writing mt 
for the J^th moment of y round the ordinate o: = 0, we haA^e 

(a-j-i54-l)m^-(a + /5 + j? + 2)mm= .... (4) 

Write, as before, the total of the exposed =E(), and of the 
deaths = respectively, and represent the total of the exposed 
multiplied at each age by the factor ey^' by E'o . 

Let Yjt and 6t be the j^th moments of the curve of exposed to 
risk and of the recomputed deaths, the areas of the curves not 
being taken as = 1, but having the values Eo and do al:)Ove defined, 
that is to say, representing the total exposures and the tcftal deaths. 
And let E'^ be the jfth moment of the curve of exposures multiplied 
at each age by 

Then we . have Ot — AEe -f BE'g (5) 

where 9t and Ej are known, but the remaining quantities unknown. 
From (3) and (4) — 

(a ^ 4- l)Ej — (a -4- ^ 4- if 4- 2)Ei-f-i = 0 (6) 

and (a 4" if 4" l)E ^ — (a4"^4"if4~ 2)E ^ 4-1 4* ylE ^ 4 -i — E ^ 4 - 2 ) — 0 . ( / ) 

W^rite (a 4- if 4“ — (a -h ^ 4- if 4“ ~ > 

from ( 5 ) 

(a 4- if 4- 1 ) (AEj 4- BE 35 ) — (a 4- /5 4- if 4- 2 ) (AE^+i 4- BE ^ 4 - 1 ) = 
and from (6) 

(a 4- if 4* l)BE i — (a 4- 4- if 4- 2)BE i4-x ~ 

and from (7) 


By (E t+2 ■” E t+i) = 


( 8 ) 
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Since from (5) 

BE< = 0t — AE{ 

we have 7[(^*i+2- 0(+i) - A(Et+2-E(+i)]=Ej . . . . (9) 

■writing i = 0 aiid if = 1 respectively, we get 


y[(d2-di)-A(E2-Ei)]=Ko 


7[(^3-d2)-A(Es-E2)] = Ki 

whence R'i[(d2 - 6 j) - A(E2 - Ei)] = EoK^s - O^) - A(E3 — E2)] 


and 


also, from (9) 


El(d2-dl)-Eo(d3-02) 

Ei(E2-Ei)-^(Es-E,) 

Kq 

^ “ (^2 - di) - A{E2 - El) 


. . ( 10 ) 


The value of B cannot be determined directly from these equations 
as it enters symmetrically with the values of It is therefore 

necessary, having- found the value of y, to compute the value of E'o 
and thence deduce B from equation ( 5 ). 

Unless the mortality follows Makeham’s law very closely better 
results win be obtained by calculating both E'o and E'l and obtaining 
values of A and B satisfying the equations 

AEo + BE'o = ^ol 

AEi 4 - BE'i = J 



Tables of Values of y 

y ' 

A 


y 

01128 

1128 

•51 

•52924 

02256 

1128 

•52 

•53790 

■03384 1 

1127 

•63 

•54646 

■04511 

1126 

•54 

•55494 

•05637 

i 1125 

•55 

•56332 


84.681 403 


11 -29742 


26570 1063 


34 

•36936 

1002 

35 

•37938 

995 


44 *46623 


•89 

•79184 1 

507 

■90 

•79691 i 

1 

497 

•91 

•80188 

i 489 

•92 

•80677 

479 

*93 

•81156 

471 

•94 

•81627 

462 

•95 

*82089 

453 

•96 

•82542 

445 

*97 

•82987 

436 

•98 

•83423 

428 

•99 

•83851 

419 

1-00 

•84270 

411 
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Table of Values of -^\ e~^‘^.dx — continued. 

0 




A 

- 

y 

A 

- 

y 

A ! 

1*51 

•96728 i 

117 

2*01 ; 

*995525 

195 

2*51 

•9996143 1 

202 i 

1*52 

•96841 i 

111 

2*02 ^ 

•995720 i 

186 

2*52 i 

*9996345 ' 

192 

1*53 

*96952 1 

107 

203 i 

•995906 i 

180 

2-53 1 

•9996537 ' 

183 

1*54 

•97059 ! 

103 

2-04 ; 

*996086 ' 

172 

2*54 1 

•9996720 i 

173 

1*55 

*97162 1 

101 

2-05 : 

*996258 : 

165 

2*55 

■9996893 ! 

165 

1*56 

•97263 1 

97 

2*06 i 

•996423 ^ 

159 

2*56 ' 

•9997058 : 

157 

1*57 

*97360 i 

95 

2*07 1 

*996582 i 

152 

2-57 

*9997215 : 

149 

1*58 

*97455 

91 

2-08 i 

*996734 : 

146 

2*58 ' 

•9997364 : 

141 

1*59 

*97546 

89 

209 I 

*996880 i 

141 

2*59 i 

*9997505 ; 

135 

1*60 

•97635 1 

86 

2*10 1 

! 

•997021 

134 

2-60 i 

•9997640 

127 

1*61 

•97721 

83 

2*11 1 

*997155 

129 

2-61 I 

•9997767 ! 

121 

1*62 

*97804 

80 

2*12 1 

•997284 

123 

2*62 

*9997888 ! 

115 

1*63 

•97884 

78 

2*13 i 

•997407 

118 

2-63 

• 999 S 003 

109 

1*64 

*97962 

76 

2*14 

*997525 

114 

3*64 

•9998112 

103 

1*65 

•98038 

72 

2-15 

*997639 

108 

2*65 

•9998215 

98 

1-66 

•98110 

71 

2*16 

*997747 

104 

2-66 

•9998313 

93 1 

1*67 

*98181 

68 

2*17 

*997851 

100 

2*67 

• 999 S 406 

88 i 

1-68 

*98249 

66 

2*18 

*997951 

95 

2*68 

• 9998491 . 

84 1 

1*69 

*98315 

64 

2*19 

*998046 

91 

2*69 

*9998578 

79 1 

1*70 

•98379 

62 

2*20 

•998137 

87 

2*70 

•9998657 

75 j 

1*71 

*98441 

59 

2*21 

*998224 

84 

2*71 

•9998732 

71 i 

1*72 

• 9838 o 

58 

2*22 

*998308 

80 

2*72 

•9998803 

67 j 

3*73 

*98558 

55 

2*23 

•998388 

76 

2-73 

•9998870 

63 1 

1*74 

• 986 ^ 

54 , 

2*24 

•998464 

73 

2-74 

•9998933 

61 

i 1*75 

•98667 

52 - 

2*25 i 

*998537 

70 

2*75 

•9998994 

57 1 

: 1*76 

•98719 

j 50 

2*26 

*998607 

67 ! 

2*76 

•9999051 

54 i 

: 1*77 

*98769 

! 48 

2*27 

•998674 

64 1 

2*77 

•9999105 

51 ; 

1*78 

' *98817 

47 

2*28 ‘ 

: *998738 

61 i 

2*78 

•9999156 

48 1 

; 1*79 

*98864 

45 

2*29 

, -998799 

58 ^ 

2*79 

•9999204 

46 i 

i 1*80 

*98909 

43 

2*30 j 

i *998857 

55 

2*80 

•9999250 

43 ; 

’ 1*81 

•98952 

42 

2*31 ! 

•998912 

53 

2-81 

•9999293 

41 ^ 

i 1*82 

*98994 

41 

2*32 

•998965 

51 

2*82 

•9999334 

38 : 

j 1*83 

s *99035 

39 

2*33 

*999016 

49 

2*83 

•9999372 

37 : 

1*84 

i *99074 

37 

2*34 

•999065 

46 

2-84 

•9999409 

34 ' 

1 1*85 

; *99111 

36 

2*35 

•999111 

44 

2*85 

•9999443 

33 ’ 

1*86 

i *99147 

35 

2*36 

'999155 

42 

2*86 

•9999476 

31 

1*87 

; *99182 

1 34 

2*37 

•999197 

40 

2*87 

•9999507 

29 

1*88 

*99216 

i 32 

2-38 

•999237 

38 

2*88 

•9999536 

27 

1*89 

•99248 

1 31 

2*39 

•999275 

36 

2*89 

‘9999563 

26 

1*90 

, *99279 

i 30 

2*40 

•999311 

35 

2*90 

•9999589 

109 

1*91 

•99309 

i 29 

2*41 

•999346 

33 

2*95 

•9999698 

81 

1*92 

*99338 

^8 

2*42 

*999379 

3*2 

3*00 

•9999779 

60 

1*93 

; *99366 

26 

2*43 

•999411 

30 

3*05 

•9999839 

45 

1*94 

; *99392 

1 26 

2*44 

•999441 

28 

3*10 

•9999884 

32 

. 1*95 

• *99418 

; 25 

2*45 

•999469 

28 

3*15 

•9999916 

24 1 

1*96 

•99443 

23 

2*46 

*999497 

26 

3*20 

.•9999940 

29 1 

1*97 

*99466 

23 

2*47 

•999523 

24 

3*30 

•9999969 

16 i 

1*98 

: *99489 

! 22 

2*48 

*999547 

24 

3*40 

•9999985 

8 1 

1*99 

: *99511 

i 21 

2*49 

•999571 

22 

3*50 • 

•9999993 

3 1 

2*00 

*99532 

i 20 

2*50 

•999593 

21 

3-60 

•9999996 



Table of 


[The constants are restricted to positive quantities o£ significant value 


Type ■ 

Character of Curve 

Equation y — 

Limits of x 1 

Mean ! 


M:{ 

Shape 

Bange j 

Lower 

U pper 

■ ==/Ai 


I jSymmetrical 

. ! 

i 

Limited 

both 

w'ays 

1 

J-i 

k{a“ — x~) “ 

— a 

1 

-h a 

\ 

0 

rt- 

71 + 1 

0 

II ; Symmetrical 

Un- 

limited 

.r- 

jce ~ (Normal Curve) 

— 00 

+ 00 1 

■ 

0 


0 

m. 

1 

1 

* . J Un- ! 
Symmetrical! 

/t(a2+.r2)-(2+0 

(« > 3) 

1 

— 00 

+ 00 

0 

w — 1 

, 

0 

1 

I 

IV; 

Skew 

Limited 

both 

ways 

k{a — a*)”?’ ■ 1 (<3J -f a*) ~ ^ 

(p + 2 = l) 

— a 

-1- a 

(2-2i)a 

Jf? a- 

?i+ 1 ! 

e i 

1 

16(p- 2)212 
(71 + 1) (ti + 2) * 

V 

Skew 

Limited 

oneway 

m-1 “rt 

kx e 

0 

-h 00 j 

i 

r 

nia 

fl 

77ia'^ 

1 2ma^ 

VI 

Skew 

Limited 
one way 

k(x- o)«!' - ' (a? + a) - 
(2-21 = 1) 

(«> 3 ) 

i 

+ a 

1 

j + 00 

i I 

(2. + 2)a 

1 - : 

J£? = 
^ 2,-1 

16(21 + 2)2)2 
(»fc — 1)(» — 2) 

: VII 

Skew 

Limited 

a 

(»> 3 ) 

0 

+ 00 

a 

a- 

Aa^ 

one way 

n 

«“(« — 1) 

#(7J — 1)(» — 

VIIl 
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Frequency Curves. 


(i.e., all >0), and in Types III, VI, VII and VIII, n must be >3]. 
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